Ref 

# 


Hits 


Search Query 


DBS 


Default 
Operator 


Plurals 


Time Stamp 


SI !' 


=2. 


; (pa ra I lel$ 5 With H^sh^3 ; ; ; 
(join$3 or merg$3 or c6mbin$5)).:;; 
ab. and (@ad<"20010426 M ) =. :|U|==| 


US-PGPUB;: 
: USPAT; ; : 
IBMTDB ; 


ok?: 


ON ! ; 


12005/02/16 20:33.: 


S2 


78 


(hash$3 with (join$3 or merg$3 or 
combin$5)).ab. and 
(((paa< zuuiu^zb ; 


US-PGPUB; 
USPAT; 

TDM THD 
lbl V l_ I Ub 


OR 


ON 


2004/04/22 13:01 




.;** ::: '|v46 


•equijoiri;andi(@ad<?70010426 M ):;;li 


US-PGPUB; 
USPAT; ; . 


jORliPj / 


; ON 


:20b3/657l8]()^:Sl 








IBM TDB 








S4 


0 


(online with aggregat$3 with 
adapt$3 with (quer$5 or 
search$3)).ab. and 
(@ad<"20010426") 


U5-PGPUB, 

USPAT; 

IBM.TDB 


UK 


UN 


^UUJ/Ua/lo ■J/.bo 


S5 - 




(ohlinelwith aggregates with ; 
! (quer$5 or search$3)).ab. and 
I (@ad< , 20010426 ,, >^^^ 


US-PGPUB; 

!!!U : SlAl;!ilil: 
:IBM_TDB 


OR 




:2003/05/18 08:00 


S6 


0 


(online adj2 aggregat$3 adj2 
(quer$5 or search$3)) and 
(@ad< n 20010426 n ) 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2003/05/18 08:09 


S7 : 


liiiiiHi 


(parallel$3 with (hash adj ripple 
i adj join)) and (@ad< n 20010426 H ) I 


US-PGPUB; 
USPAT; 
IBM TDB 


OR 


ON : ; 


2003/05/18 08:10 


S8 


0 


(hash adj ripple adj join) and 
(@ad<"20010426") 


US-PGPUB; 

USPAT; 

IBMTDB 


OR 


ON 


2004/04/22 13:02 


S9 :; : ; 




(hash with join) same (partition$3 
with tuple) and i 
(@ad< n 20010426 M ) i 


US-PGPUB; 

USPAT;:^!! 
IBMJTDBj:: 


OR ' 


ON 


2003/05/18 08:52 


S10 


30 


(hash$3 with (join$3 or merg$3 or 
combin$5)).ab. and 
(@ad<"20010426") and 
partition$3 


I IC D^DI ID* 

Ub-rtjrUD, 

USPAT; 

IBM.TDB 


UK 




»cUUj/J.1/1t uu.^ 


sii 


38 


(((equivalent or hash$3) with • ; ; 
(jbiri$3 or merg$3 or combin$5)) :i ^ 
i with (processors or processors)) 
and partition$3 and; 
(@ad< M 20010426 M ): 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2003/11/14 14:22 


S12 


10 


(((equivalent or hash$3) with 
(join$3 or merg$3 or combin$5)) 
with (distribut$3 or redistribut$3) 
with (partition$3 or divid$3)) and 
(@ad<"20010426 M ) 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2005/02/14 20:23 


■'S;13;-; 


; :;;; : ;; :.:;;:i55: 


(hash with join$3) and 
': : (@ad<'!2p6l6426")^ 


; US-PGPUB; 

iUSPAT;::;!;:: 
: IBMi TDB 


OR ;• 


ON : ] ; 


:!2004/04722 13:06 
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^ 1 Idol 1 WIUI JUIM^J/ allVJ 

(aggregat$5 or (intermed$9 with 
result)) and f@ad<"20010426 ,r > 


l K-PftPI IR* 
Uj T Or UD/ 

USPAT; 
IBM TDB 


no 




2004/04/22 13:14 


;S.5]| 




; (hash with jpi n$3 ) a nd r(((ph 1 i he or i ; 


llS-PGPUB;: 1 


OR 1 : I 


ON: : : 


2004/04/22 16:20 1 






;;i(oniacU;;jine)^ 


-USPAT; ; . 
























(ihterm^j$9 with result)) arid = 










S16 


0 


(hash with join$3) and (split adj 

VcClUlJ dno mUNIIHc UF dUJ 

line)) adj aggregat$5) or 
(intermed$9 with result)) and 
(@ad< 20010426 ) 


US-PGPUB; 

1 IQDAT* 

IBM_TDB 


OR 


ON 


2004/04/22 13:19 


S17 : 


liiiiiiiiiiii 


(hash with join$3) and (split adj 
ve^r);and;(@ad<^20010426 n ) 


US-PGPUB; : l 
USPAT; 
IBM.TDB :!\: 


■ii!!!! 


ON 


2004/04/22 13:19!! 


S18 


15 


fha^h with ioin43) and vector and 
(((online or (on adj line)) adj 
aggregate oj or ^intermeo^y witn 

icbUllJj dMU ^vyJdU^ ^UUIUt^D J 


US-PGPUB- 
USPAT; 

TDM TPiD 
lDl v l_ 1 UD 


OR 


ON 


2004/04/22 13-23 


S19 


iiiiiiiiiiiiilii 


(hash with jbin$3) and vector and • 
(@ad<"20010426") 


US-PGPUB; 

^USPAf^i^i;;^ 


OR : 


ON : 


2004/04/22 14:24 








llBMpiDB;!!;;; 








S20 


0 


(hash with join$3) and (display$3 

Willi mUNIINc Ul ^Ull dUJ IIMcJJ dUJ 

aggregat$5) or (intermed$9 with 
result))) and ((g)ad< 2UUlU42b ) 


US-PGPUB; 

1 I^PAT 1 
UDrM 1 f 

IBM_TDB 


OR 


ON 


2004/04/29 07:49 


S21 ; 




: "5884320".pn. ; ■' 


!US-PGPUB; 


OR 


ON 


2004/04/29 07:46 










































S22 


35 


((join or joining) with simultan$9 
with (distribut$3 or redistribut$3)) 
and (@ad<"20010426") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/04/29 07:55 


S23 


503 


((join orjoining) with (simultan$9 
or "as" or "while") with 
(distribut$3 or redistribut$3)) and 
: (@ad< n 2b010426 n ) 


us-pgpub;;: 

iiiUSPAt;:!!:;!;;;;!: 
iBMTDB; 


OR 


ON 


2004/04/29 08:00 














S24 


81 


(((join orjoining) with (simultan$9 

ii ii ii i*i ti\\ 

or as or "while )) same 
(distribut$3 or redistribut$3)) and 
(parallel$3 with process$3) and 
(@BQ< 2UUlU42b ) 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/04/29 08:03 


S25 


1121 


(hash$3 with (jpin$3 or merg$3 or 
combin$5» and 


US-PGPUB; 
USPAT; 


OR 


ON 


2005/02/14 20:28 






:(@ad<"20010426") I; ! - 


: IBM TDB 








S26 


26 


(hash$3 with (join$3 or merg$3 or 
combin$5)) and ((join orjoining or 
merg$3 or combin$3) with 
(simultan$9 or "as" or "while") 
with (distribut$3 or redistribut$3)) 
and (@ad<"20010426") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2005/02/14 20:26 
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C77 




^nasn^j wiin yoin^j or merg^j 
or combin$5)) same (aggregat$5 
or (intermed$9 with result))) and 
(@ad<"20010426") 


1 IC D(ZO\ ID- 

USPAT; 
IBM_TDB 


no 

UK 


UIN 


"5nnc/n*5/i4 *?no7 

ZUUD/UZ/It IXj.CI 




: 1 1 .• 


: ;^ndsn$a ; :Witn^ 

or combin$5)) same (((bnlirielor 
(on adj line)) adj aggregates) or 


:::|:|C ^Df*DI ID* <■ 

USPAT; 
IBM_TDB : 


. UK ' ; 


UN ;: 


: : o nn c /no / 1 4 : : *) n i 1 7 ■ 

:.2yUb/yZ/14 l\J.£/ 






:(intermecl^9: With result))) and 




























(@ad< !, 20010426 M ) 










b^y 




v nasn$j witn (jom$o or merg$^ or 

rnmhin*fc""i^ anH ^cnlif'fc^ wit"h 

\~\J\\\\J\\\yD } } CIIIU ^0|JIII^>J WIUI 

(vector or table or matrix)) and 
(@ad<"20010426") 


1 IC D^DI ID» 

USPAT' 

TDM TPkD 


no 

UK 


UIN 


zuuo/uz/It zu.zy 


S3Q: ; ; 


• 2 


: (hash$3 With (jbin$3 6r nri£rg$3 or 


US-PGPUB; 


;0R;;; 


ON 


2005/02/14 20:29 






combin$5)) and ((distribut$3 or 


USPAT; : : : ; 












i:redistiibut$3)i With Split$5Hwithi:n : li 


IBM TDB 1: i 
























'. (vector or table or matrix) 














(@ad<"20010426") : - 










Sll 


2 


"6804678" on "6745198" Dn 


US-PGPUB 1 

USPAT; 

IBM_TDB 


OR 


ON 


2005/02/17 14'20 


;S32: V : 




;^49300?2lpri; :i'5551<)3l"jpin; 


US-PGPUB; ; 


:iQR^:i;i: : : ::': 




2005/02/17 18:06:: 






j " : 555779i;;.pn; "5745896".pn. 


: USPAT;! 












! , 5832475 M .ph^ "6032144" :pn. , ' 
"6061676 ! !.pn. "6081801".pn. 
"6112198".pn. "6205441".pn. 

■ " il ^ "1 ^C'C ~>C\\\\ : — — ' : MCA'i nmil : ::::::::::::: 

6226639 .pn. : 6415297 .pn. 


IBM :TDB :::: 












j"6484159"jpn. "6493701".pn. : ! ■■. 
"6618719".pn; "6625593".pn. 

"22026103793" 










S33 


7 


("4930072".pn. "5551031".pn. 
"5557791".pn. "5745896".pn. 
"5832475".pn. "6032144".pn. 
"6061676".pn. "6081801".pn. 
"6112198".pn. "6205441".pn. 
"6226639".pn. "6415297".pn. 
"64841 59". pn. "6493701".pn. 
oolo/iy .pn. ooZdd^d .pn. 
"22020103793") and fhash$3 with 
join$3) and (distribut$3 or 
redistribut$3) and partition$3 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2005/02/17 16:18 




^ Q 


("4930072".pn. ,, 5551031 M .pn. 


US-PGPUB 


OR 

-..yly...-. 


ON ^ ! 


2005/02/17 16-07 ^ 






I'qqRTVQi !!;hh "^74^RQfi":hn 

:: DDD f f 13 1 .*\J\\* J /:TT; J.O «?,0 . |JII. 


l UJrtt l y :: : : : 
:::IBM^TDB ; 












,, 5832475 , R : pn^"6032144 i| vpn. : .. 












"6061676".pn. "608i801".pn. - 
; "61 12198".pn. "6205441".ph. = " : : ' : 


























; "6226639";pn. !!6415297 M :pn. : : !: : 














b4OTiby vpn^ b4yj/ui .pn; : 














H 6618719"vpn. "6625593 ! !.pn.: : ; 
"22620103793") and 
intermediat$3 .; 











Search History 2/17/05 6:46:19 PM Page 3 
C:\APPS\EAST\Workspaces\09842991-l.wsp 



S35 


1 


("4930072".pn. "5551031".pn. 
"5557791".pn. "5745896".pn. 
"5832475".pn. "6032144".pn. 
"6061676".pn. "6081801".pn. 
"6112198". pn. "6205441".pn. 
"6226639".pn. "6415297".pn. 
"6484159".pn. "6493701".pn. 
"6618719".pn. "6625593".pn. 
22020103793 ) and (hash$3 with 
join$3) and ((dynamic$5 with 
distribut$3) or redistribut$3) and 
partition$3 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2005/02/17 16:18 
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r*: > i'/':JAPIO Nov 1 976-2003/Dgc ( Updated 040402) ' 

(c) 2004 JPO & JAPIO 
File 350:Derwent WPIX 1963-2004/UD, UM &UP=200426 

(c) 2004 Thomson Derwent 

Set Items Description 

51 13903 (JOIN??? OR EQUIJOIN???) (7N) {ROW? ? OR TUPLE? ? OR COLUMN? 

? OR FIELD? ? OR CELL? ? OR TABLE? ?) 

52 1295 S1(7N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARAL- 

LEL OR SYNCHRONI2? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR 
SAME () TIME OR AS OR WHILE) 

53 4 734 90 DATABASE? ? OR DATA {) BASE? ? OR TABLE? ? 

54 204 S2 AND S3 
^3^->Sf . y 2"4 4 S4 AND IC=G06F 

56 "'363073 ROW? ? OR TUPLE? ? OR COLUMN? ? 

57 18246 S6(5N) (REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? - 

OR SHIFT??? OR RESHIFT??? OR MOV??? OR MIX??? OR REORDER??? OR 

SORT??? OR RESORT??? OR REARRANG? OR REORGANI? OR REGROUP??? 
OR RE() (ARRANG? OR ORG AN I? OR GROUP??? OR ORDER?)) 

58 2645 SK25N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARA- 

LLEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC - 
OR SAME { ) TIME OR AS OR WHILE) 

59 433 SI AND S8 AND S3 

510 6 S9 AND S7 

511 S.10 NOT S5 

17837 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (S- 
IMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARALLEL OR SYNC- 
HRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR SAME ( ) TIM- 
E) 

SI3 23 S12 AND S3 AND IC=G06F 

JS14;. - 17. # S13 NOT (S5 OR Sll) 



5/5/1 (Item 1 from file: 347) 

DIALOG (R) File 347: JAPIO 
■ (c) 2004 JPO & JAPIO. All rts. reserv. 

""Image available** 
,•01 NT BODY RESEARCH SUPPORT SYSTEM 

PUB. NO. : 2002-132873 [JP 2002132873 A] 

PUBLISHED: May 10, 2002 (20020510) 

INVENTOR(S): BEPPU FUSAO 

APPLICANT(S) : AINEKKUSU KK 

APPL. NO. : 2000-358501 [JP 2000358501] 

FILED: October 20, 2000 (20001020) 

INTL CLASS: G06F-017/60 ; G06F-019/00 

ABSTRACT 

PROBLEM TO BE SOLVED: To provide a joint body research support system which 
ran improve the reliability of a result body to be delivered to an electric 
:.-ov;er company, reduce operation for entry into a joint facility research 
table by a field operator making a research into joint bodies, and 
*:.or:en the research time. 

SOLUTION: An area power distribution block database is generated by 
inputting joint body information based upon the contents of a contract with 
a requester for the installation of joint bodies by area power distribution 
blocks as objects of research and the joint body facility research 
table is generated which is used to make a previous research of joint 
body articles, inspect differences from an actual state after the field 
research is completed, totalize information of contents different from 
joinc body information based upon the contract with the mentioned 
rori'jesr.er, and generate the result body by inputting information generated 
judging a joint cable of a joint body found by the field research by a 
.■.v-.rcvir.icacion cable external-diameter measurement system and entering all 
r o search results. 

COPYRIGHT : (C) 2002, JPO 



5/5/2 (Item 2 from file: 347) 

DIALOG (R) File 347: JAPIO 
(c) 2004 JPO & JAPIO. All rts. reserv. 

• 04062383 ++ Image available** 
DATABASE RETRIEVAL SYSTEM 

0!)-0540S3 [JP 5054083 A] 
' . :/M*:::: March 05, 1993 (19930305) 
V \s* : ICHIYAMA TOSHIHARU 

■ i '"ANT { s ) : NEC CORP [000423] (A Japanese Company or Corporation), JP 
(Japan) 

APPL . NO. : 03-235452 [JP 91235452] 

FILED: August 21, 1991 (19910821) 

INTL CLASS: [5] G06F-015/40 ; G06F-012/00 

JAPIO CLASS: 4 5.4 (INFORMATION PROCESSING — Computer Applications); 4 5.2 
(INFORMATION PROCESSING — Memory Units) 

-Jdy^-NAJr: Sect-ion: -P— Sec ti on -Not -15-71,- Vol— r? — Not "3 6 67 "Pg.-~ ITT JuTy " 

09, 1993 (19930709) 

ABSTRACT 

i'i:ki-Y>SE: To automatically extract the joint information between tables 
without; any leaking even in the database composed of many tables by 
retrieving the database only by describing the conditions concerning the 
column of respective tables even when a user does not have the knowledge 
concerning the structure of the database which is a retrieval object and 
the detailed knowlledge concerning the joint information between the 
tables . 



CONSTITUTION: This system has a joint information holding means 4 to hold 



rh*» joint information between tables of a database , a joint conditions 
imperting means 1 to obtain the joint information from the joint 
information holding means 4 and output a retrieval expression to add the 

joint information with the table retrieval expression as an input and 
a database control means 2 to perform the retrieval of a database 
storage means 3 while the retrieval expression is received, and has a joint 
information extraction means 5 to extract the joint informaiton between the 

tables of the database storage means 3 from the database control 
means 2 and send the extracted joint information to the joint information 
holding means 4 . 



5/5/3 (Item 3 from file: 347) 

■*•■:.• 54 /: JAPIO 

; * .iAPIO. All rus. reserv. 



'. '\ .<c?61 ' 'Image available** 
OUTER JOINING OPERATION SYSTEM OF RELATIONAL DATA BASE 



PUB. NO. : 59-125461 [JP 59125461 A] 

PUBLISHED: July 19, 1984 (19840719) 
INVENTOR(s): TEZUKA MASAYOSHI 

ADACHI SUSUMU 

NAKADA TERUO 

YAMANE YASUO 

MAKINOUCHI AKIFUMI 

APPLICANT { s ) : FUJITSU LTD [000522] (A Japanese Company or Corporation), JP 
(Japan ) 

AFt'L. NO. : 57-234066 [JP 82234066] 

f-'i LED: December 30, 1982 (19821230) 

iNTL CLASS: [3] G06F-013/00 ; G06F-007/22 

JAPIO CLASS: 4 5.2 (INFORMATION PROCESSING — Memory Units); 4 5.1 

(INFORMATION PROCESSING — Arithmetic Sequence Units) 

JOURNAL: Section: P, Section No. 315, Vol. 08, No. 254, Pg. 87, 

November 21, 1984 (19841121) 



ABSTRACT 

PURPOSE: To realize a high-speed processing system by scanning 
simultaneously relation to be joined together, comparing to decide 
■ ' ' <! join field value is larger, smaller, or is equal, etc., to 
*: : :• joining, and to produce the relation of result. 

:*.. '*: ;' : 'T ! ON : A relational data base processing mechanism 1 accepts an 
ry from a user and passes it to a relational operation control part 2. 
The relational operation control part 2 checks the adequacy of the inquiry. 
An execution control part 6 sorts and extracts relation operation to 
determine its execution procedure, which is passed to an interpretation 
execution part 3 and executed. Then, an outer joining execution part 10 
performs outer joining processing. In outer join operation processing, the 
values of joint field are extracted and compared successively from the 
starting record of plural sort relation to generate outer joining result 
responding to the result decided to be larger, smaller or is equal, and the 
result is inserted into the relation of results. 



~5/5V~4 "(Item~l"~ffbm~f Tie" "350) ~ 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015892836 **Image available** 

WPI Acc No: 2004-050671/200405 

XRPX Acc No: N04-040933 

Query optimization method in relational database system, involves 
joining pair of tables having respective primary and foreign key 
columns using specified join condition, and generating derived date 
constraint rule 

P^r^n: Assignee: NCR CORP { NATC ) 

':.v-r::or: GHAZAL A S; SINCLAIR P L 



Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6662175 Bl 20031209 US 2001850765 A 20010508 200405 B 

Priority Applications (No Type Date): US 2001850765 A 20010508 
Patenc Details : 

No Kind Lan Pg Main IPC Filing Notes 

'.<'.o;::* : t> ei is GO6F-017/30 

Ahsi.racc (Basic): US 6662175 Bl 

NOVELTY - A pair of tables having respective primary key (PK) and 
foreign key (FK) columns are joined , using PK is equal to FK as 
join condition. An initial running constraint (RC) having null range is 
created. A derived date constraint rule (DDCR) is generated, based on 
correlated value columns of join result. The RC is modified by merging 
range of RC with range of new constraint that is computed for each row 
in join result. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
computer program for optimizing queries to database . 

USE - For optimizing queries in relational database system such 
as ceradata active data warehousing system and other database systems 
r.avlng massively parallel processing (MPP) architecture or symmetric 
■iiliIl iprocessing (SMP) architecture . 

ADVANTAGE - Automatically derives constraint rules for correlated 
variables, and reduces the size of intermediate result. 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart of 
algorithm for deriving and applying derived date constraint rule. 

pp; 15 DwgNo 4/9 

Title Terms: QUERY; OPTIMUM; METHOD; RELATED; DATABASE ; SYSTEM; JOIN; 

PAIR; TABLE ; RESPECTIVE; PRIMARY; FOREIGN; KEY; COLUMN; SPECIFIED; JOIN 
; CONDITION; GENERATE; DERIVATIVE; DATE; CONSTRAIN; RULE 

Derwent Class: T01 

International Patent Class (Main): G06F-017/30 

; : 'L*o Segment: EPI 



5/5/5 (Item 2 from file: 350) 

DIALOG ( R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015834435 **Image available** 
WPI Acc No: 2003-896639/200382 

Related WPI Acc No: 2001-496729; 2003-660928; 2003-897748 

XRPX Acc No: N03-715546 

Data tubes populate method for data management system, involves joining 
columns or domains of table with dimensions and other relations mapped 
into hyper cube to populated hyper cube 

Parent Assignee: DECODE GENETICS EHF {DECO-N) 

i.-.vfsntor: EGILSSON A S; GU.DBJARTSSON H 

Uij::i;jc,vr of Countries: 001 Number of Patents: 001 

r 6;-or.c Family: 

! J aLenc No Kind Date Applicat No Kind Date Week 

US 20030023608 Al 20030130 US 99475436 A 19991230 200382 B 

US 2002216670 A 20020808 



Priority Applications {No Type Date): US 2002216670 A 20020808; US 99475436 

A 19991230 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 20030023608 Al 27 G06F-007/00 CIP of application US 99475436 

CIP of patent US 6434557 

At s:.i'j-;r. (Basic): US 20030023608 Al 

NOVELTY - The method involves representing a calculated relation 
as a table supported by columns or domains and joining columns 
or domains of the table with dimensions and other relations mapped 
into a hypercube. The new relations are created from existing relations 



and table -like representations of calculated relations. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
cube populace syscem. 

- i ; or daca management system. 
ADVANTAGE - Eliminates ambiguities observed in combined 
Measurements used to populate a hypercube. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
data tube populate system. 

data tube populate system (100) 

database (101) 
online analytic processing server (102) 
online analytic processing client (103) 
network (104) 
pp; 27 DwgNo 1/14 

Title Terms: DATA; TUBE; METHOD; DATA; MANAGEMENT; SYSTEM; JOIN; COLUMN ; 
DOMAIN; TABLE ; DIMENSION; RELATED; MAP; POPULATION 

: w^;;i- Class : T01 

r i*r.h\ Patent Class (Main): G06F-007/00 
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DIALOG ( R) File 350:Derwent WPIX 
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015397411 **Image available** 

WPI Acc No: 2003-459551/200344 

XRAM Acc No: C03-122363 

XRPX Acc No: N03-365537 

Safety busbar system, especially for a tablet production machine 
comprises system elements which are jointly capable of simultaneous 
implementation of safety, control and measurement functions 

•/stent Assignee: KORSCH AG (KORS-N) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

DE 20304950 Ul 20030528 DE 2003U2004950 U 20030321 200344 B 

Priority Applications (No Type Date): DE 2003U2004950 U 20030321 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
DE 20304950 Ul 14 G06F-013/38 

M:rfr (Basic): DE 20304950 Ul 

NOVELTY - Safety busbar system (10), especially for a tablet 
; r .h... ion machine (12) comprises at least one module (14) compatible 
wiv.h the busbar system and/or at least one busbar controller (16) and 
«t least one busbar conductor (18) . The system ensures essentially 
simultaneous implementation of at least one safety function, at least 
one control function and/or at least one measurement function. 

USE - Used as a safety system for a tablet production machine. 

ADVANTAGE - The amount of wiring and the general system complexity 
are reduced together with fault hazards. 

DESCRIPTION OF DRAWING (S) - The drawing shows a schematic view of 
the proposed safety busbar system. 
Tablets —production- ma chine -(-12-) ~ 

Module (14) 

Busbar controller (16) 
Busbar conductor (18) 
Actuator (20) 
Sensor (22) 

Emergency disconnector switch (24) 
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Document mining method involves removing instance of dirty text within 
document to produce clean document and then performing data mining 
operation 

: : <•:■ Assignee: CASTELLANOS M (CAST- 1);. STINGER J R (STIN-I) 

--.or: CASTELLANOS M; STINGER J R 
dumber of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030046263 Al 20030306 US 2001944919 A 20010831 200341 B 

Priority Applications (No Type Date): US 2001944919 A 20010831 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030046263 Al 13 G06F-007/00 

Abstract (Basic): US 20030046263 Al 

NOVELTY - An instance of dirty text within the document, is removed 
to produce a clean document. Data mining operation is then performed on 
the clean document. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer system; and 

(2) computer usable recorded medium storing document mining 
program. 

USE - For mining document containing dirty text such as 
typographical errors, misspellings, joined words, ad hoc 
abbreviations, bad grammar, cryptic tables , programming code, core 
rvjmps, missing or ambiguous punctuation and haphazard capitalization. 

ADVANTAGE - Allows the user to leverage existing domain knowledge 

'jricibles easy customization of document containing dirty text 
•v:co::i L nq to the domain and task requirements. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
i he document mining system. 
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Replication system for database 

in several master tables using preset data in the tables and 
generates replica table by replicating the matched data items 

Patent Assignee: YOKOUCHI H (YOKO-I) 
:nvenior: YOKOUCHI H 

ri-jiiibf-r of Countries: 001 Number of Patents: 001 

I !*»*:.: Fhml J y : 

: M' Kind Date Applicat No Kind Date Week 

; 3:716 Al 20020711 US 2001809257 A 20010316 200267 B 



management in firm, matches data items 



US 2002101065 A 20020320 



Priority Applications (No Type Date) : JP 2000294551 A 20000927 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

<J5i 20020091716 Al 14 G06F-007/00 Cont of application US 2001809257 

Abstract (Basic): US 20020091716 Al 

NOVELTY - The data items in several master tables (108,109) are 
matched using preset data in the master tables as a joining key. 
A replica table (121) is generated by replicating the matched data 
items of the master tables . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
replication program. 

USE - Replication system for database including salary details in 
a firm. 

ADVANTAGE - As the data items of several master tables are 
replicated and included in a single replica table , job execution time 
of the replica system, is reduced, thus efficiency is improved. 

DESCRIPTION OF DRAWING (S) - The figure shows the data flow in the 
replication system. 

Master tables (108,109) 

Replica table (121) 
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Data replication system for employee management system, matches data 
items of several master tables and replicates matched data items to 
generate replica table using particular data in master table 

Patent Assignee: HITACHI LTD (HITA ); YOKOUCHI H (YOKO-I) 

Inventor: YOKOUCHI H 

Number of Countries: 002 Number of Patents: 002 

: ' r • ::i Kami 1 y : 

r • \- '< Kind Date Applicat No Kind Date Week 

ai 20020328 us 2001809257 a 20010316 200245 b 

/i.'-^ol A 20020412 JP 2000294551 A 20000927 200245 

t r LoriLy. Applications (No Type Date): JP 2000294551 A 20000927 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020038315 Al 14 G06F-017/30 

-j-p- ^00 21-08-6 8-i-'-a -re-Goe-F-tmvoo 

Abstract (Basic): US 20020038315 Al 

NOVELTY - A matching unit matches data items of a number of master 
tables (108,109) using particular data in the master tables as a 
joining key to join each other. A replicating unit replicates the 
■notched data items based on the joining key to generate one replica 
table (121). 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
data replication program. 

USE - For replicating data of several master tables of database 
in employee management system. 

ADVANTAGE - The data of number of master tables is replicated in 
one replica table , and hence job execution time is minimized. 



DESCRIPTION OF DRAWING (S ) - 
data replica system. 

Master tables £108,109) 
Replica table (121) 
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Elbow rest has joint that is fitted to base, fixing tool that fixes joint 
to base, support that protrudes in front of base when joint is fixed to 
base, and elbow accepting unit that is attached to support 

Patent Assignee: HOGURA Y (HOGU-I) 

Number of Countries: 001 Number of Patents: 001- 

Patent Family: 

Pc'-ent No Kind Date Applicat No Kind Date Week 

•i ?::((205] 862 A 20020219 JP 2000245062 A 20000811 200232 B 

Priority Applications (No Type Date) : JP 2000245062 A 20000811 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2002051862 A 8 A47B-096/18 

Abstract (Basic): JP 2002051862 A 

NOVELTY - A fixing tool (3) fixes a joint (2) that is fitted to a 
base (1) such as a desk or table . A support articulated by the 
joint protrudes in front of the base when the joint is fixed to the 
base. An elbow accepting unit (5) is attached to the support. 

"SE - For e.g. desk, table . 

A: VANTAGE - Simplifies attachment to or detachment from base. 
: * Mricjnt co be adjusted. Enables work bench to be reduced in 

DESCRIPTION OF DRAWING (S) - The figure is the perspective diagram 
r;L the elbow rest. 
Base (1) 
Joint (2) 
Fixing tool (3) 
Elbow accepting unit (5) 
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Relational database systems, using equijoin operations giving a merge 



join process that creates sorted set of outer table rows that satisfy 

selection criteria and enable parallel execution 
Patent Assignee: UNISYS CORP (BURS ) 
Inventor: LIU L H 

Number of Countries: 001 Number of Patents: 001 
Potent Family: 

:-iU n" No Kind Date Applicat No Kind Date Week 
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US 6185557 Bl 13 G06F-017/00 

Abstract (Basic): US 6185557 Bl 

NOVELTY - When performing equijoin operations on two tables , 
as long as outer table join column value is less than or equal 
to last key value, the same data page from inner table will be 
searched repeatedly. Therefore inner table index records will not be 
revisited until outer table join column is greater than last data 
page key value. Combination of next key and last key allows merge join 
process to determine that entire ranges of outer rows do not have 
r>\c: Lching inner rows . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) A computer readable medium having computer executable modules 
for joining data rows from two tables . 

(2) A merge join process for joining data rows from two tables 
which have a common data column. 

USE - Relational database systems 

ADVANTAGE - Designed to minimize processor time and file input and 
output when performing equijoin operations on two tables . Reduces or 
eliminates searches of the inner index records and the data pages. 
Reduces cache thrashing on the inner index records and so the required 
index record is likely to be in the cache when needed. Minimizing the 
"raversal of the index records and data pages on mass storage minimizes 
* he number of operations performed, and therefore provides a more 
efficient search process. Also , because the merge join process is 
structurally suited for execution on the multi-processor computers, the 
speed of the database queries can be increased through parallel 
processing . 

DESCRIPTION OF DRAWING (S) - Merge join process logic flow diagram, 
pp; 13 DwgNo 0/4 
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Database outputs required search data by joining data retrieved from 
tables specified by divided main keys 
Patent Assignee: SONY CORP (SONY ) 
Numbs r of Countries: 001 Number of Patents: 001 
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Ai'SLr-JCt {Basic); JP 2001005815 A 

NOVELTY - The main key corresponding to each data is divided 
according to components of the data such that each divided main key 
specifies the table ( 120A-120C).. In order to search data, the main 
key input by operator is divided and data in each table specified by 
the divided main key is joined and output as the required result. 
USE - In database for searching data from tables . 
ADVANTAGE - The search time can be shortened by using the divided 
main key instead of unitary main key for searching purpose. Due to the 
reduction of amount of data and shortening of search time, the workload 
of. maintenance operation can also be reduced. 

DESCRIPTION OF DRAWING (S) - The figure shows explanatory drawing of 
database . 

Table (120A-120C) 
. ? y; V DwcjNo 1/3 
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Incremental refresh performing method for materialized view in database 
management systems, involves deleting rows, that are attained by 
combining specific rows with changed rows of selected table 

Patent Assignee: ORACLE CORP (ORAC-N) 

Inventor: DIAS K; WITKOWSKI A 

Number of Countries: 001 Number of Patents: 001 
Patent - Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6125360 A 20000926 US 98109115 A 19980702 200107 B 

;•: t\-y Applications (No Type Date): US 98109115 A 19980702 

'. V' i [ ! s : 

: SV. Kind Lan Pg Main IPC Filing Notes 
• : ..: ,: -:b0 A 24 G06F-017/30 

Acscracc (Basic): US 6125360 A 

NOVELTY - The base table of materialized view is established as 
selected table . The row (Tj) which is processed, is combined with 
both changed and unchanged rows of selected table . The rows that are 
attached due to combining of changed rows of selected ' table with Tj 
in materialized view is deleted, and the rows in which Tj combines with 
unchanged rows of selected table is left. 

DETAILED DESCRIPTION - The base table of a materialized view is 

aata-b.lis-h-ed -as- -a- -selected- —table -r—I-f -the- selected" "table ~is~~£Tfe ' 

right table of outer join , the materialized view is updated to 
reflect deletions to selected table by processing each row (Tj), that 
combines with changed row of selected table . If Tj combines in 
materialized view with changed rows of selected table , then the rows 
containing Tj in materialized view is removed and replaced with a row 
in which selected columns are set to NULL. INDEPENDENT CLAIMS are also 
included for the following: 

(a) database system; 

(b) incremental refresh program 

USE - For maintenance of materialized views that contain one-to-N 
lossless joins in database management system. 

ADVANTAGE - Since the incremental refresh technique does not 



require information about the order of updates to the base tables , 
the overhead associated with maintaining sequencing information is 
avoided. Since the technique is idempotent in performing an incremental 
refresh N-times on the same data, after a system crash, the incremental 
refresh operation is restarted from the beginning without taking into 
account how far the operation had progressed prior to the crash. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart that 
1 lust rates the incremental refresh operation. 
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Computer aided designing /computer aided machine conversion system e.g. 
for PCB fabrication, converts CAD system data to source table , inorder 
to link, data edition number with comprehensive edition number 

Patent Assignee: FUJITSU LTD (FUIT ) 

Number, of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 2000263354 A 20000926 JP 9974476 A 19990318 200102 B 
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t (Basic): JP 2000263354 A 

NOVELTY - Data of each CAD system contains intrinsic format and 
c'aca edition number with format common to a CAD system. A converter 
converts CAD system data to source table comprising information 
regarding board, mounting, node, inorder to link data editing number 
with comprehensive edition number which are managed by a management 
unit . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) CAD/CAM conversion procedure; 

(b) recording medium with data conversion program 

USE - E.g. for printed circuit board (PCB) fabrication. 
ADVANTAGE - Automatic conversion is performed at mounting angle 
joined on output destination line as table maintains mounting 
angle information at both sides rotations for each component. Since 
data format of different CAD system is converted into an unification 
data format, each CAD data is managed with identical database . 

DESCR-LPT-I ON- OF -DRAWING-* S-)- — The -f igure- shows-fire "di-a-gram 

illustrating components of CAD/CAM conversion system. 
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Access management method for shared resource in multiprocessing system, 
involves choosing subject processor to operate in DUAL mode or SOLO mode 
in response to information exchange attempt being success or failure 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: FREITAS R F; JADAV D; KENCHAMMANA-HOSEKOTE D; MENON J M; STRONG H 
R 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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M: ,r ,f)99 A 22 G06F-012/00 

Abstract (Basic) : US 6105099 A 

NOVELTY - Each of the processors (102,104) sends self-state message 
repeatedly to other processor. The subject processor chooses operating 
mode in response to preset power-up conditions. Any information 
required to synchronize processor's lock tables is exchanged 
suitably during join operation. If attempted information exchange 
succeeds, subject processor is chosen to operate in DUAL mode, else in 
SOLO mode. 

DETAILED DESCRIPTION - The subject processor establishes message 
• rr.riin as birch cry message when lock tables (108,110) of subject 
processor requires information from other processor's lock table . The 
processor is operated in DUAL mode or SOLO mode only after completion 
ol join operation. When the processor in DUAL mode, the message content 
is established by processor as heart beat message. The processor 
combines with other processor to establish single processor locks on 
sub-parts of shared resources (106) to satisfy host requests to access 
the shared resource comprising digital data storage. When failure is 
indicated from other processor, the subject processor is operated in 
SOLO mode during which message content is established as death knell 
message. The processor attempts to acquire sub-part of shared resource 
and to store a predetermined code at each sub-part. The access to 
shared resource is enabled without consulting the other processor only 
: y . = rement of sub-pares of shared resource. INDEPENDENT CLAIMS are 
:;:S;:ded for the following: 

. w access management program; 

digital data processing system 

t'SE - For managing access to shared resources in multiprocessing 
systems used in data processing applications such as automated teller 
network, airline reservation systems, stock brokerage, etc. 

ADVANTAGE - Makes better use of high through-put shared resource by 
efficiently sharing the resources among hierarchically superior hosts. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
computer system with multiple processors. 
Processors— ( 10-2-,- 1-0-4- ) 

Shared resources (106) 

Lock tables (108,110) 
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Duplicate tuples elimination method in database management system 

Patent Assignee: SYBASE INC (SYBA-N) 
Inventor: HI LLEGAS R 
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Abstract (Basic) : US 5937401 A 

NOVELTY - The query for generating the tuple stream satisfying the 
selection criteria is executed, by scanning the selected database 
tables (250) according to the determined join order. The inner most 
table is scanned, thereby executing the filter which filters the 
explicate tuples from the tuple stream. 

DETAILED DESCRIPTION - A received query specifies the selection 
criteria for the information of interest from the database system. 
The determined join order indicates the innermost and outermost 
tables of the selected join so as to guarantee that the tuples 
will stream in order during scanning of the query. A filter is 
initialized at the outermost table for key columns to pass the 
initial tuple encountered from which an initial key is constructed. On 
execution the tuples having keys already encountered in the tuple 
stream are discarded by the filter attached to the innermost table . 

USE - For eliminating duplicate tuples in a generated tuple stream 
in a database management system. 

ADVANTAGE - The duplicate tuples are eliminated from the tuple 
!-»-ir. -without the need for performing an expensive sort operation by 
: : vlri.sr:: Lbed method. 

DESCRIPTION OF DRAWING (S) - The figure is a block diagram of a 
1 : ent-server system with the duplicate tuples elimination method. 
Database tables (250) 
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Data processing system for accessing database - has catalogue analysis 
module which selects table within database , to create ID key for each 
set of columns in table identified as unique set 

Patent Assignee: ACTUATE SOFTWARE INC (ACTU-N) 

Inventor: YOUNG C 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Absir-a--c (Basic): US 5864856 A 

NOVELTY - A dictionary builder (50) includes catalogue analysis 
:--i:\ t ;^ it2) 'chat selects table within the database , and creates ID 
r>*y cor each set of one or more columns in the table identified as 
a unique set. A join key referencing selected table is created in 
any other table in the database having a set of columns with column 
names matching with that of ID key. DETAILED DESCRIPTION - A user 
interface (114) is coupled to a CPU (110). A dictionary buffer which is 
also coupled to CPU scans the database (130) to create dictionary 
containing attributes that define relationship between tables in the 
database and couples dictionary to the database . The table 
includes columns having column names. INDEPENDENT CLAIMS are also 
included for the following: information accessing method from database 
; dictionary builder. 

USE - For accessing information from relational databases . 

ADVANTAGE - Avoids need for extensive programming by system 
administrator to access database interactively. DESCRIPTION OF 
DRAWING (S) - The figure shows the block diagram of data processing 
system. (50) Dictionary builder; (52) Catalogue analysis module; (110) 
CPU; (114) User interface; (130) Database . 
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Title Terms: DATA; PROCESS; SYSTEM; ACCESS; DATABASE ; CATALOGUE; ANALYSE ; 

MODULE; SELECT; TABLE ; DATABASE ; ID; KEY; SET; COLUMN; TABLE ; 

IDENTIFY; UNIQUE; SET 
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Joining method in database for one or more input tables comprising 
records stored in storage medium * using join index and minimizing number 
of input/ output operations while maximizing use of small main memory 
through buffer allocation process based on join index entries 

Patent Assignee: UNIV COLUMBIA NEW YORK (UYCO ) 

Inventor: LEI H; ROSS K A 

Number of Countries: 021 Number of Patents: 002 
f-.jr^nc Family: 

Patent No Kind Date Applicat No Kind Date Week 
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' > -.98J215 A 19991109 US 97853108 A 19970508 199954 

Priority Applications (No Type Date): US 97853108 A 19970508 
Patent Details: 
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Designated States (National): CA JP 

Designated States (Regional): AT BE CH CY DE DK ES FI FR GB GR IE IT LU 
MC NL PT SE 

^3215 A G06F-017/30 

AIM r«a (Basic): WO 9850867 A 

The method involves allocating an array of partitions in a memory. 
Next a join index is read indicative of records to be joined. The join 
index entries are stored corresponding to each of one or more input 
tables of stored records and a corresponding partition identifier to 
temporary files associated with the allocated partitions. The index 
entries and the corresponding partition identifier in each of the 



temporary files are read in turn and each temporary file is sorted, 
forcions of the tables are sequentially read only if the portion 
includes a record identified in the sorted temporary file. 

The read records are written in accordance with an order of the 
corresponding partition identifiers to separate output files associated 
with each input table . The allocation step equally distributes the 
records of the input table among the array of partitions. 

ADVANTAGE - Join technique performs join operation irrespective of 
join index order. Processes all input tables simultaneously . Can 
perform self- join operation without reading input table multiple 
times . 
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Aid preparation method for recognising database statement with data 
structure in memory device of computer system - involves determining 
ratio of number of distinct rows satisfying join statement in detail 
table and master table for each directional link associated with 
detail and master table 

Patent Assignee: ORACLE CORP (ORAC-N) 

Inventor: TOW D S 

Number of Countries: 001 Number of Patents: 001 
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The method involves defining a set of nodes representing tables . 
A set of directional links are defined between pairs of nodes, each of 
which representing a master detail relationship between a detailed 
table and corresponding master table . A data structure referred to 
as a join free comprising representation of nodes and their 
directional links is defined in a memory device. A set of zeros or more 
selectivity factors are represented for each node. 

Each selectivity factor indicates expected fraction of rows in 
table represented by node that satisfies one or more logical 

_r-pr.ri i f i cms -Set _£o rt-h- i-n -t-h e- da t a- acc es s- s t a-t erne nt\— T he~r a t io of" ~ri umbe r~~ 

cMst inct rows satisfying join statement in detail table to number 
o.t distinct rows satisfying join statement in master table is 
decermined for each directional link associated with detail table and 
master table . The probability of row in the detail table 
corresponds to that of rows in master table . 

ADVANTAGE - Improves statement execution efficiency. Produces truly 
optimise statements. 

Dwg.7/7 

Title Terms: AID; PREPARATION; METHOD; RECOGNISE; DATABASE ; STATEMENT; 
DATA; STRUCTURE; MEMORY; DEVICE; COMPUTER; SYSTEM; DETERMINE; RATIO; 
NUMBER; DISTINCT; ROW; SATISFY; JOIN; STATEMENT; DETAIL; TABLE ; MASTER; 
TABLE ; DIRECTION; LINK; ASSOCIATE; DETAIL; MASTER; TABLE 



Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



5/5/20 (Item 17 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 
(c) 2004 Thomson Derwent. All rts. reserv. 

• • 1 '; r = : -. ** * Image available** 

W: : A ■ No: 1998-322214/199828 

•:m :•: a.-u no: NSS-252044 

Query optimisation method for relation database - involves finding JOIN 
conditions to form chain to form tables according to graph join theory 
and reordering in FROM clause 
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The query optimisation method involves examining the WHERE clause 
and finding all join conditions that are present. A chain of the join 
conditions is formed . 

A list of tables is established where i) tables that are in the 
WHERE clause but not in the join conditions are listed first; ii) 
tables in the join chains formed according to graph theory are listed 
next in the same order as they appear in the join chain such that 
the tables from the longest chains are listed before the tables 
; : oiu shorter chains and all tables at a given distance from a root 
table ol* the chain, occur together before the next level in the join 
: : --!ir.. Trie tables are reordered in the FROM clause in the list order. 

ADVANTAGE - Quickly finds access plan. Orders tables in FROM 
clause according to optimal join order in WHERE clause determined by 
graph theory. 
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GUI data access method for relational database - involves joining 
cables on common field using operator and defining logical schema 
defining table hierarchy 

Patent Assignee: INFORMIX SOFTWARE INC (INFO-N) 

Inventor: JACKSON B D; MALONEY C W; MAYFIELD K B; MILLS M A; TRACY K A 
:virn:M*r of Countries: 001 Number of Patents: 001 

: "!:\ Family : 

; r tv.-. No Kind Date Applicat No Kind Date Week 

V/01453 A 19971223 US 9386001 A 19930701 199806 B 



Priority Applications (No Type Date): US 9386001 A 19930701 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 5701453 A 23 G06F-017/30 

Ar>p; r*cr. (Basic): US 5701453 A 

The data access method involves specifying at least one logical 
L-elacionship by selecting a pair of tables having a common field from 
among the relational database tables . The common field of the pair 
of tables is selected as a join field . A join operator is 
selected for the join field to define a join between the pair of 
tables , the logical relationship being one of One-to-One, 
One-to-Zero-or-One, One-to-Zero-or-More, and One-to-One-or-More . 

A logical schema is created def.ining a multilevel hierarchy of the 
tables in the specified logical relationships. A first table and a 
second table . of a pair of tables in a logical relationship are at 
an equal hierarchy level if the logical relationship is one of 
One-to-One and One-to-Zero-or-One . The first table is at a higher 
hierarchy level than the second table if the logical relationship is 
one of One-to-Zero-or-More One-to-One-or-More. 

ADVANTAGE - Avoids user having to understand physical database 
schema to access required data. 
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Computer data base and retrieval method - analysing join statements 
using graphical technique to determine groups of tables represented as 
nodes 
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Abstract (Basic) : EP 476810 A 

Data is retrieved using a request including a set of joint 
statements, each linking the name of two tables . The data processor 
includes an element to assign priority to one table name in ech joint 
statement. An array of graph identifiers corresponding to the table 



ririines is formed. Each group identifier is initialised to have a value 
representing the coressponding table name. Each joint statement is 
processed in succession. 

The value of the priority name is substituted in the array of 
graph identifier values in place of each value representing the other 
name to derive an indication of the number of groups of linked names in 
the set of entered join statements. 

USE /ADVANTAGE - Assures coherency of join list without need for 
re-entering all of elements of join list 
Title Terms: COMPUTER; DATA; BASE; RETRIEVAL; METHOD; ANALYSE; JOIN; 
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Joining selected data in tables of relational data base system - 
defining first parameter, selecting data in first table , placing in 
sub- table , defining second parameter and selecting second data 
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Abstract (Basic): EP 442684 A 

The method for joining selected data in two tables (12,10) in a 
relational data base system which involves defining a first 
parameter (35), selecting data in the first table which satisfies the 
first parameter and placing such selected data in order in a first sub- 
table (38). A second parameter is defined and data is selected in the 
second table which satisfies the second parameter. 

Selecting data which satisfies the second parameter involves 

defining, a- pre-l-imi-nar-y -parameter— based- on-the— data ~in~th"e~fi"rst~ sub" — 

table . Data in the second table is selected which satisfies the 
preliminary parameter and placed in order in a second sub table (45). 
Data in the second sub table which satisfies the second parameter is 
selected and combined with the data in the first sub table so as to 
join the selected data in the first and second tables (50) . 

ADVANTAGE - Highly efficient I/O operations. (14pp Dwg.No.5/5 
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The join operation is performed in three stages with an optional 
fourth stage, the details of which depend on the underlying join 
algorithm used. The first stage is a preparatory stage that provides 
data for the second stage to use as a basis for defining the subtasks 
for the final join operation and to allocate subtasks evenly to 
different processors, even in the presence of data skew. 

Once the second stage has completed its processing, the subtasks 
are shipped to their assigned processors for processing and the final 
join of the two relations in the third stage. Optionally, during the 
join in the third stage, there could be a dynamic reassignment of the 
subtasks should the join operation become unbalanced. 
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Processing data base of system having memory unit - storing tables 

with columns and rows of data 
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A set of sorted columns of one table stored in a DB machine 
is binary-searched by a hardware in synchronism with data transfer 
when another table is read from a disk unit to select a row to be 
joined . 

Where a key word sequence to be searched is stored in ascending or 
descending order, determination of a max. address in an area in which 
key words smaller than a search key are stored and determination of a 
max. address in an area in which the key words smaller than or equal to 
the search key are stored are in parallel executed to determine the 
storage range in one search. 

ADVANTAGE - Increases speed of join operation in relational data 
base machine . 
Dwg.1/23 
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ABSTRACT 

PURPOSE: To greatly shorten join processing time by dividing a join 
processing into plural CPU for parallel execution. 

CONSTITUTION: The 1st table for execution of joins is decentralized to 
! .*■■--,! memories 7-10 and the 2nd table for joins is superposed on those 
•;-;•* s --10 respectively. Here processors 3-6 perform the 1:1 comparison 
:* the partial records of the 1st and 2nd tables and store 

toss i.ve.1 y the pairs of records satisfying the connecting conditions into 
prescribed areas of a common memory device 1. Thus a low table 
undergone the join processing is finally produced on the device L As a 
result, the joint processing is possible in a short period without carrying 
out any sorting process in case one of both tables is especially large or 
small . 



14/5/5 (Item 5 from file: 347) 

DIALOG (R) File 347: JAPIO 

(c) 2004 JPO & JAPIO. All rts. reserv. 



!U229045 **Image available** 
SIMULTANEOUS JOINING PROCESSING SYSTEM FOR PLURAL ITEMS 

PU3. NO. : 58-166445 [JP 58166445 A] 

PUBLISHED: October 01, 1983 (19831001) 
INVENTOR(s): OHARA TOSHISAKU 

APPLICANT (s) : FUJITSU LTD [000522] (A Japanese Company or Corporation), JP 
(Japan) 

APPL. NO. : 57-049456 (JP 8249456] 
FILED: March 27, 1982 (19820327) 

INTL CLASS : [3] G06F-007/28 

■ ! APTO CLASS: 45.1 (INFORMATION PROCESSING — Arithmetic Sequence Units); 
45.2 (INFORMATION PROCESSING — Memory Units) 
Section: P, Section No. 246, Vol. 08, No. 1, Pg. 97, January 
06, 1934 (19840106) 



ABSTRACT 

PURPOSE: To make a line-by-line decision between acceptance and rejection 
through <=2-times matching, by utilizing the previous sequencing of data in 
a table and to exclude lines after the decision from the matching and 
improve processing efficiency. 

CONSTITUTION: When an inquiry input for joint processing is received from a 
display terminal 5, relative tables are loaded from a data file 3 to a 
memory 2. Pointers A and B are initialized to indicate the uppermost lines 
of the tables . An arithmetic processing part 10 performs the matching 
between the tables A and B on the basis of retrieval conditions of shown 
elements. A pointer control part 11 increases the pointer A or B by one 



every time the matching is carried out once to indicate the next line. 
Records of a coincident line are put together in output format by an 
input/output processing part 12 and transferred to the terminal 5. 
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Abstract (Basic): US 20030055830 Al 

NOVELTY - A table join grid row having potential valid table 
join for two database tables represented in a join overview 
diagram, is selected from a grid. A line is formed between the icons 
representing the selected tables in the diagram, upon accepting the 
selected join grid row. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
r m [ : owing : 

;i; program storage device storing selected valid table joins 
: : s c l a y program; a nd 

(2) selected valid table joins display system. 

USE - For displaying selected table joins in relational database 

management system (RDBMS) of computer. 

ADVANTAGE - The join grid and the join overview diagram are 
always displayed together and are synchronized . Hence, if a user 
chooses a join in the join grid, the join overview diagram 
automatically reflects the selection by drawing the line, which 
highlights the associated row in the join grid. Each row in the join 
grid automatically provides a summary of all valid two-column 
combinacions of the selected two tables . Thus, the user need not 
s* ■•!■•»':!: each of the two columns in the join grid separately. Provides 

: l ieien t table join process even for a large number of tables and 
columns . 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart explaining 
table joins display and selection process, 
pp; 11 DwgNo 5/5 
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Method for managing advertisement information 
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Patent Details: 
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KR 2001107227 A 1 G06F-017/60 

Abstract (Basic): KR 2001107227 A 

NOVELTY - A method for managing advertisement information is 
provided to enable a client to get profit due to advertisement 
information, and to enable an advertiser to get more efficient effect 
oi advertisement with less cost by using quiz game. 

DETAILED DESCRIPTION -(a service operator interface part (301) 
interfaces communication with a computer of a service operator (100) . An 
information classifying part (302) classifies advertisement information 
provided from an advertiser, quiz information provided from the service 
operator ( 100) , member information inputted from a client, an input 
information inputted correspondingly to the quiz by the client and 
information about right answerer respectively. Then, the information 
classifying part (302) stores the classified information in 
corresponding database (303-307) respectively. If a client joins as 
member, a part of requesting input of member information (308 ) requests 
" S° client to input various personal information including ID and 
;*■*■" a* *• r r~« > Al the same time , the part of requesting input of member 
. * fiK: l i or i ( 308 ) requests the client who joined as member to input 
* :r* It) and the password. An external interface part(309) interfaces an 
■gracing server(300) with communication network(500) . When a client 
registered as member connects to the operator server (300) and joins in 
quiz, a quiz information displaying part (310) extracts quiz information 
stored in a quiz information database (303), and provides the 
extracted quiz information to the client, 
pp; 1 DwgNo 1/10 
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Data retrieval optimization method for relational database management 
system, involves creating aggregate join index by combining commonly used 
columns of tables with results from computed aggregation expressions 
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Abstract (Basic): EP 1164509 A2 

NOVELTY - The aggregation expressions from the columns of tables 
are computed. An aggregate join index is created for the tables by 
combining commonly used columns of the tables with results from the 
expressions . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following 

(a) Computerized database management system; 

(b) Data structure; 

(c) Data retrieval optimization program; 

(d) Computer readable storage medium storing data retrieval 
optimization program 

USE - For mainframe, mini computer, personal computer for use in 
relational database management system (claimed) . 

ADVANTAGE - Allows users to perform aggregation operations reliably 
using the aggregate join index. Enhances parallel processing across 
multiple access module processors. 

DESCRIPTION OF DRAWING (S) - The figure shows the hardware and 
software environment. 

pp; 12 DwgNo 1/4 
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Exploitation of db2 universal database design rules in graphical 

representations 
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Abs; race (Basic) : RD 439106 A 

NOVELTY - Relational and physical database design is driven by a 
database designer's heuristic knowledge of database design. The 
present invention utilizes a set of DB2 designer rules derived for DB2 

Unij/ersa_l _Database (JJ.DB4 -Parallel— Ed-i-t-i-on— and- BB-2- UDB - physical 

database ob j ect s . 

DETAILED DESCRIPTION - These rules are useful to designers in the 
performance of various tasks in physical design, such as the creation 
of indexes to optimize the access path for minimal costs; the 
assignment of tables to appropriate tablespaces; or in the parallel 

database environment, table collocation, which will occur if two 
large sized tables aree involved in frequent joins . Each design 
advice action is connected to a set of heuristic rules. Through the 
graphical representation of DB2 UDB design rules users can access the 
sot of rules and tailor the set or individual rules to meet the need of 
j heir database applications. Examples of rule sets include activate 
:,:!.-^s, deactivate rules and modify rules. A set of heuristic rules 



takes the provided data load and workload values when it generates a 
design proposal. If no information is available, the heuristic rule set 
will use predefined default values with its rulse. The following figure 
shows Creation of Index Rules using graphical representations. The 
advice generated using the rule set is given in the form of a report 
that lists the design steps to consider. When design proposals are 
requested, one can either accept or reject a designn proposal. The 
io.ilowing cwo figures show proposed actions and a proposed report for a 
table object using graphical representations . 

USE - None given. 

pp; 2 DwgNo 0/0 
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Edit log management system of engineering drawing, stores drawing joining 
management and authority registration tables based on whose contents 
joining of drawing data and editing authority are managed respectively 

Patent Assignee: KENSETSUSHO KENCHIKU KENYUSHOCHO (KENS-N) ; SHIMIZU CONSTR 
CO LTD (SHMC ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 
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Abstract (Basic) : JP 2000298678 A 

NOVELTY - The data processor (1) produces 2D engineering drawing 
which is split in tree form and authority of editing are managed based 
on contents registered in drawing joining management table (28) and 
authority registration table (25) . The editing of drawing is 
performed based on parent drawing attribute data (22) and contents of 
updating registration table (24) . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 

USE - For parallel editing and log management of engineering 
drawings produced during large scale construction project. 

ADVANTAGE - Since editing of drawing data corresponding to parent 
attribute data and updated data is performed and managed based on 
respective tables , editing is performed simultaneously and 
parallely by several operators, enabling drawing joining 
centralization unit to be set to the desire of user and preventing 
mi s t a ke s_o.f _ ed i ti ng. sa me -d r a-w i-n g— b y - s ever a 1 -ope rat o r st " 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
edit log management system. 

Data processor (1) 

Parent drawing attribute data (22) 

H'-.i i st ran ion tables (24,25) 

' ; :wiri:.j joining management table (28) 
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Query processing method for relational database management system, 
involves performing join operation of data streams locally, after 
detecting appropriate conjunct predicates 

Patent Assignee: INT BUSINESS MACHINES CORP (I BMC ) 

Inventor: LOHMAN G M; PIRAHESH M H; SHEKITA E J; SIMMEN D E; URATA M S 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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W'.asic) : US 6112198 A 

NOVELTY - Two partitioned data streams that relate to join 
•.peracion, are received. Usage possibility of conjunct predicates to 
locally perform a parallel inner join or outer join operation, is 
determined. The join operation is performed locally, if the detection 
result approves the usage of appropriate conjunct predicates. 

USE - For parallel query processing in SQL processing relational 
database managements system (RDBMS) in computer network. 

ADVANTAGE - Enables to optimize or avoid data repartitioning by 
recognizing the possible partitioning requirements for achieving 
parallelism for a query operation, and when the partitioning property 
of data satisfies the partitioning requirements of query operation. 

DESCRIPTION OF DRAWING ( S) - The figure represents the query 
execution plan for directed join operation in DBMS. 
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Abstract (Basic) : US 5884320 A 

NOVELTY - The join operations are assigned to the processors 
using index structures. The data points are simultaneously 
redistributed and joined in the processors in parallel based on 
predetermined joining conditions. 

DETAILED DESCRIPTION - The data points among the processors are 
partitioned and an index structure for these data points is created. 
The index structure has several leaf nodes corresponding to a subset of 
;:<JLa points. INDEPENDENT CLAIMS are also included for the following; 

(a) computer product; 

(b) database system. 

USE - In microprocessor system. 

ADVANTAGE - Builds index structure and performs joins efficiently 
using minimum amount of storage space due to system's parallelism. 

DESCRIPTION OF DRAWING (S) - The figure shows flowchart of overall 
operational sequence of performing join operations on data points of 
database in parallel . 
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Abstract (Basic) : GB 2287807 A 

The system joins distributed data with a join key and produces a 
joined table (100,200). Recording devices, disk drives <4a-4d) store 
the distributed data e.g. employee data (5a-5d) and sales data (6a-6d) . 
Slave-processors (3a-3d) are coupled to the recording devices to 
retrieve the data and output the data. The main processor (1) receives 
rhe data from the slave-processors and produces the joined table . 

Krjch slave-processor checks a join key of the second data, sales 
with a join key of the first data . Based on the checking 
n.-suJt it selects the second data (400a-400d) and outputs the data to 
* !-ie main processor. 



USE/ADVANTAGE - Provides high speed joining , eliminates burden on 
master processor as data distributed and stored in slave processors, 
part of join processing done in parallel . 
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Join processor for relational database - extracts join fields of 
relations from aux. memory and sorts with aux. processors, dividing and 
storing in parallel 

Patent Assignee: MITSUBISHI DENKI KK (MITQ ) * 

Inventor: MINEMURA H; NAKAMURA S 

Number of Countries: 003 Number of Patents: 004 

Patent Family: 

Applicat No Kind Date 
GB 9018947 A 19900830 



Patent No 
r,p, 35798 

*■(• V ; '<4 4 



'•C 

; fib2 



Kind 
A 
A 
B 
A 



Date 
19910313 
19910301 
19930901 
19930921 



GB 9018947 
US 90576202 



A 
A 



19900830 
19900829 



Week 
199111 
199119 
199335 
199339 



B 



-loricy Applications (No Type Date): JP 90151114 A 19900608; JP 89225815 A 
1S390831 



Patent Details: 

Patent No Kind Lan Pg Main IPC 

GB 2235798 B 20 G06F-015/40 

US 5247662 A 13 G06F-007/24 



Filing Notes 



Abstract (Basic) : GB 2235798 A 

A join processor for relational databases includes a main 
processor (1) for processing information. A main memory (2) is 
connected to the main processor to store information. A number of 
auxiliary processors (3) are controlled by the main processor. 

A number of auxiliary memories (4) are each connected to the 
corresp. auxiliary processors. A device enables the auxiliary 
processors to perform in parallel dividing and storing relations of 
relational databases in the auxiliary memories in units of record and 
extracting from the auxiliary memories and sorting join fields of the 
relations to be joined. 

USE - Fast join processor for a large size of relations to be 
joined, e.g., employee records. (21pp Dwg.No.1/6 
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Set Items Description 

51 7315 {JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (7N) (RO- 

W? ? OR TUPLE? ? OR COLUMN? ? OR ■ FIELD? ? OR CELL? ? OR TABLE? 
?) 

52 1422 SI (7N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARAL- 

LEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR 
SAME () TIME OR AS OR WHILE) 
: >'i9024 DATABASE? ? OR DATA ( ) BASE? ? OR TABLE? ? 
M 13343 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (S- 

IMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARALLEL OR SYNC- 
HRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR SAME ( ) TIM- 
' E) 
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58 84186 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (W- 

HILE OR AS) 

59 6 S3(50N)S5(50N)S8 

510 31724 6 ROW? ? OR TUPLE? ? OR COLUMN? ? 

511 18855 S10(5N) (REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? 

OR SHIFT??? OR RESHIFT??? OR MOV??? OR MIX??? OR REORDER??? OR 
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OR RE ( ) (ARRANG? OR ORGANI? OR GROUP??? OR ORDER?)) 
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Eliminating group-by operation in a join plan 
Beseitigen von "group-by "-Befehlen aus einem Verbindungsplan 
Elimination des operations d 1 aggregation "group-by" dans un plan de 
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. SPECIFICATION that such attributes were at some point part of a 
candidate key (since they were originally associated with data elements 
having Va lue ( underscore) 1 that have subsequently been changed to 
Wi lue (underscore) 3 as a result of a join operation). As explained below, 
this change from Value ( underscore) 1 to Value (underscore) 3 was made due to 
the fact that the join may cause duplicate values to occur in the 
result, thereby rendering the candidate key to no longer be unique. 
However, after the group by, duplicates are removed and the candidate key 
is again unique for each row. 

If neither condition 512 nor 516 is satisfied, then the optimizer 
module 20 sets ( at 520) the data elements of all attributes (the 
grouping fields) to Value (underscore) 2 . In effect, as part of the 
partial group -by operation on a table T, the grouping fields are set 
to either Value (underscore) 1 or Value (underscore) 2, depending on the 
conditions noted above. 

The optimizer module 20 also updates (at 522) the data structure 401 in 
:-s:^r!st a to a join of two tables (table Tl and... 
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. .SPECIFICATION facilities 1201-4)) by the parsing engine 130 (not shown). 
For example, two columns 210, 220 can be designated as the primary index 
when the table is created. The hash function is then applied to the 
contents of columns 210, 220 for each row. The hash bucket portion of the 
resulting. . . 

..number and the hash function is the sum of remainder when the row number 
is divided by four and the value one, the first eight rows will be 
distributed as shown in Fig. 2. 

Queries involving the values of columns in the primary index can be 
efficiently executed because the processing module llOn) ) having... 

..from row 2 are desired, the parsing engine 130 can apply the hashing 
function to determine that only processing module 1102)) need to be used. 
As another example, an equality join between two tables that have the 
same primary index columns is very efficient. All of the rows that need 
to be joined are found in the same data storage facility 120n) ) and no 
movement of information from rows , between the facilities is necessary. 

While the primary index of a table can be chosen for equality 
joins , for example the order number column of an order table , 
additional design features can make range searches, for example a range 
c; iiates from 'the date column, more efficient. Referring to Fig. 3, a 
: i : ■ : ■ : cined . . . 
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....SPECIFICATION the further Group-by on A of the Group-by on B has an 
identical result of group-by A. Mathematically, if t is a table , then 

Group-by on B (t) (contains subset) Group-by on A(t) . 

Group-by on A (Group-by on B (t)) = Group-by on A(t). 

The Group-by on B groups or sorts all rows using attributes in B 
before an aggregation is performed. Since B is larger than A, the set of 
rows with a ...is the subset of rows with the same constant in A. Hence, 
group-by B is a finer partition of group-by A. 

Given two tables tl and t2, the following proposition (referred to 
as "Proposition 1") is correct: 
(ul x t2) * = ( tl * x t2' ) ' 
;t J >: c2) ' = (tl 1 x tl) • 

This provides flexibility in how the tables are joined to achieve 
■he- final join result. The optimizer module 20 can thus choose among 
plural join paths that involve Groupby tables . 

Given a table t, a Group-by on attribute x is a partition of the rows 
into r.he classes of the same value in x. If the table... 
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... SPECIFICATION array (see symbol 2401 of Fig. 24), the number of record 
ir::nbers starting from what position that are associated with record 
numbers in the master table can be found from the count and start 
position, respectively. To wit, the aforementioned position matches the 
position indicated by the count and the aforementioned. . . 

. . . total and start position looked up in Step 2703, and in the array of 
pointers to the value list within the information block including the 
fields to be displayed in the joined table (view), the pointer 
values indicating the various record numbers are fetched and rearranged 
in order as a new pointer array (Step 2703) . Thereby, among the group 
of information blocks to be joined , those to be derived from the master 
table are complete upon their creation. 

Ir. Fiq. 28, regarding record number "0," "0" is the pointer value to 
:* ■ .'T : t:\sponding position (row 1) within the... 

. . . : -j re ^ei;ched. From these values, one can see that the "2-0=2" record 
':i;rrioers "1" and "3" from the position corresponding to "0" ( row 1) in 
the sorted set ( sort array) are the record numbers of the slave table 
associated to record number "0" of the master table . Accordingly, on 
the slave table side, the pointer values of "0" and "1" are fetched 
sequentially from the positions indicated by the record numbers "1" and 
"3" (row 2 and row 4) within the information block for the "Game date" 
field to be displayed in the table (view), and these are placed as 
new pointers in the value list of the information block for "Ticket/Game 
date" which is an information block for the joined table (view) . Note 
that the value list contained in this information block for "Ticket/Game 
date" is identical to the value list contained in the information... 
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. . .SPECIFICATION of the outer table for rows which satisfy the local 

• :orif!.L t i on ; chen, for each of these rows, a complete join column scan of 
: :,v \ ii:. or Table 10 or a probe via an index on the join column is made 
linn matching records. 
The sort /merge join of the prior art is illustrated in Figure 3 of the 
accompanying drawings. A pre-condition of this technique is that the 
outer and inner tables be ordered based upon their join columns, which 
enables the join to be done similarly to the MERGE phase of a SORT 
routine. Relatedly, a table can be ordered by sorting, or reached 
through an index to provide access in join column sequence. Local 
predicates on each table are applied before a sort, as illustrated in 
Figure 3. Thus, the Table 20 is built from the Table 12 by sorting 
the manager = Davis rows in order by employee number magnitude. 
Similarly, a sorted Table 22 is derived from the skill Table 10 by 
selecting skill table records having "test" in their skill fields and 
ordering those records by employee number. Once join column order is 
imposed on the tables , the join is done very efficiently using the join 

column sorted structures. Thus, the join begins by scanning the join 
column of the Table 22 with the employee number 53, corresponding to the 
first entry in. . . 

...and the scan of the join column of the Table 22 is picked up from where 
the last scan stopped. In this manner, the join columns of the sorted 
Tables 20 and 22 are each scanned only once in effecting the join of the 
tables.. In comparison, the nested loop join procedure requires that... 

. . . table row is found satisfying the outer table local predicate or 
predicates . 

The nested loop join technique makes efficient use of an index on the 
join column of the inner table . The nested loop join technique is 

j'i when the join column values passed -to the inner table are in 
.■■••'^:t?n.:o and the join column index of the inner table is clustered, 
^fiv! when the number of rows retrieved in the inner table by finding 
matching values in the index is small. As is known, an index is 
clustered when the rows of. the table are, for the most part, stored in 
the same physical sequence as the sequence of key values. 

The drawbacks of the nested loop join are... scan on T2 is open, Then 
Close it. 

If scan on intermediate result table is open, then 

The scan on the index of Table 10 is discussed in greater detail 
below. This scan progresses sequentially through the index on the join 
column of the Table 10 to retrieve the... 

...KID's of the Table 10 rows in an index format having the form (key 
V'.^-je:RID list). Since the scan is keying on the join column of 
Table 10, the key value is the value of the join column of an outer 
table row , while the parameter "RID list" is a list which identifies 
table rows which have that key value. 
The procedure 35 in Figure A sorts the rows of Table 12 by the 
local predicate or parameter (manager = "Davis") and orders the 
corresponding records by their join column values. The outcome of the 



:.-rcc»ss is represented by the Table 38. 

Returning co the open cursor process of Flow Chart I, a DO loop is 
opened in step 102 which is continued until the end of file (EOF) 
indicators are encountered in Table 12. In the procedure, for each of 
the rows in the outer table , Table 12, which satisfies the predicates 
or parameters which are local to the outer table, the RID list is 
obtained from the index on the Table ... illustrates the embodiment of the 
invention described above and embodied in Flow Charts I-III. One 
principal precondition of the embodiment is that the outer table be 
ordered by sorting or indexing on the join predicates or table space scan 
with a well-clustered index on the join column. In Figure... 

.is not limited to ordering the outer table by the procedure 35. As is 
known, if an index exists on one or more of the join columns of the ■ 
• -ijitir table , ordering is implicit in the index. 

The technique described also presupposes ordering of the inner table 
by indexing on its join columns. Again, indexing implies ordering and 
supports sequential access to the rows of the table if the RID 1 s are 
ordered. 

Thus, using the rows of the outer Table 12, ordered and sorted by 
the join column , the technique described takes, as an index key 
value, the employee number of the first record, which is 53. This key 
value is used to scan the index of Table 10. In this regard, the scan 
on the index of Table 10 starts at the set (1:9) and scans from there 
down to the set (53:1,11). The list (1,11) is fetched from the set and 
two composite rows are built and placed in the Table 45. When placed in 
•::he table , the rows are ordered, so that the first row is that denoted 
":■■/ -he RID value of 1 and the second by the RID value... 
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. SPECIFICATION data skew by partitioning the join operation into separate 
jobs and optimally scheduling the jobs among a plurality of processors. 

A common operation- in relational database systems is the natural join 
of two relations on respective columns defined over a common domain. See, 
for example, the description of the natural join. . . 

. row is the concatenation of two rows, one from each of the original 
relations, such that both rows have the same value in their respective 
join columns . 

One popular algorithm for computing the join of two relations is the 
sort-merge technique as described by M.Blasgen and K. Eswaran in 
"Storage and Access in Relational Databases", IBM Systems Journal, vol. 
4, pp. 363 et seq. (1977). It can be summarized briefly as follows: 
First, each of the relations is sorted (if necessary) according to the 
join column . Second, the two sorted relations are scanned in the 
obvious interlocked sequence and merged for rows which have equal 
values . 

When sort -merge joins are performed in parallel on a multiple 
processor database system, there exists a problem of data skew that might 
• •: : n ;he join columns of the relations. In general, the issue of 
;:Ol addressed by the join algorithms described in the literature. 
-.»c*rJy article on parallel... 

.runs from disk. The merge tree is mapped to different processors with 
the final merge being sequential. 

In "Join and Semijoin Algorithms for a Multiprocessor Database 
Machine", ACM Trans, on Database Machines, vol. 9, no. 1, March 1984, pp. 
133-161, P. Valduriez and G. Gardarin describe the algorithm generalized 
to. . . 
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Detailed Description 

Detailed Description 

can be a table scan, a full index scan or a partial index scan for 
example . 

For a join statement that joins more than 2 tables , the query optimizer 
chooses which pair of tables is joined first and then which table is 
joined to the result, and so on. The query optimizer then chooses an 
operation to use to 1 5 perform the join operation. 

:rw set, the inner row set is accessed to find all the matching rows 
join . Therefore, in a nested loop join , the inner row set is 
: :*»:>sed as many times as the number of rows in the outer row set. 

;n a sort merge j oin, the two row - sets -being j oined are sorted 
by the j oin keys if they are not already in key order. 

In a hash join, the inner row set is hashed into memory, and a hash 
table is built using the join key, which is the probe key for the join 
operation. Each row from the outer row set is then hashed, and the hash 
table is probed to join all matching rows. If the inner row set is very 
large, then only a portion of it is hashed into memory... the optimizer 
decides 

that the amount of data is large enough to warrant a hash join, or it is 
•.:i:abJe to drive from the outer table to the inner table . The outer 
' .ir.lt? (wiuh preserved rows) is used to build the hash table, and the 
inner cable is used to probe the hash table. 

.-->f" merge j 6ins can be used to join rows from two independent 

sujrces . 

Sort merge joins are useful when the join condition between two tables 
is an inequality condition (but not a nonequality) like <, <--, >, or >=. 
In a merge join , there is no concept of a driving table . This type 
of join operation may optionally be perfon-ned as follows. 

15 1. Sort join operation: Both inputs are sorted on the join key. 

2. Merge join operation: The sorted lists are merged together. 

ii ihe input is already sorted by the j oin column , then a sort j 

oin 

operation is not performed for that row source. 

Sort merge outerjoins are used when an outerjoin cannot drive from the 
outer table to the inner table. 

A 
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: V* tailed Des c r i p t i on 

... .obtain che contents of the change table 223 in the order in which 

;^ f> t't.i l ion originally occurred, a database join operation between the 
■*. i.vie table 223 and the transaction 240 can be used. For example, one 
Implementation of the present invention may use the following SQL join 
operation, in which change table 223 has source table columns 231 
named CI, ...that all SQL statements within a given 

transaction have the same value for a transaction identifier and that 
this value was stored in the transaction table 240 in the same row as 
the commit system change number 243. Therefore a join across the two 
tables matches up all SQL statements belonging to a particular 
transaction with their associated commit system change 

number 24 3. Moreover, the SQL ORDER BY clause returns the change rows 
in 

increasing sorted order, according to their commit system change 
number, which is to say, in the original order in which the transactions 
committed. 

Moreover, the end user or the subscriber application 121 need not be 
aware of the fact that there is a join between the change table (that 
does not contain the commit system change number) , and the transaction 
table {that contains the commit system change number and little else) . 
The feature may be attained by providing a subscriber view 251 , 
generated on behalf of... 
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Detailed Description 

Detailed Description 

for the Parity column, the "FRENCH" value corresponding to the record 
is nhe value in the corresponding cell of the FRENCH column in the value 
table . Tn the example, the entry in row 5 of the Parity column of the 
: MS'.anrje table is associated with the record being reconstructed. Thus, 
i he "French" value is found in row 5 of the "French" column of the value 
table , whose value is "Trois". 

Alternatively, an unsorted column may be included in the data structures 
of the present invention by using the identity permutation as the 
permutation for that column (i.e., the value table for that column 
will not be reordered in any way) . 
1 5 

Column Merge Compression 
In accordance with a farther embodiment of the invention, separate value 

table columns can be merged into a single column, referred to herein 
as a "union column, " with separate displacement list columns for each of 
the original columns. This has the potential advantages of having a 
smaller value table , pre- joined data expediting join operations and 
improved update speed. A value not present in a particular original 
column is indicated in the displacement table column by a null range 
for that value. For example (assuming a "first row number" format 
displacement table ) , if the original column did not have the value at 
rcwYof the merged column, the displacement table for that column would 
have the same value at row Y and row T+ l f (that is Displacement - Table 
( rfl, c) Displacement -Table (r, c) -O) . If Y is the last row in the column, 
its value is set to a number greater... 
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Detailed Description 

. . . 0, "Country" 312, and "Language" 314 columns. 

All of the above Figure 2 and Figure 3 elements discussed are real 
elements . 

They exist inside relational tables managed by the system. The system 
also provides for other types of identifiers, namely virtual elements. 
Virtual elements are identifiers that are not real elements... 
...for example, population", "age", and "religion". The second category of 
virtual element, complex virtual elements, represent information that is 
computed by the system from table join operations or other types of 
r. t r i thrnetic processing . 



example, the "cast list" 914 associated with a movie, such as the 
.novie titled "Rain Man", can be queried like an attribute of a particular 
movie. No such attribute exists as a real element inside any of the 
movie related tables of the illustrative system I 00. The system 1 00, 
can perform a table join of the Movie Information Table 202 and the 
Movie Personnel Table I I 00 to match a movie identifiers in the 
columns 204 and I 1 04 from each table. 

Figure 1 1 is a block diagram depicting a Movie Personnel Information 
Table I I 00. The role identifier column 1 102, lists role identifiers 
'hat identify roles associated with various movies as indicated by the 
movie identifiers residing in the same row and I 0 listed in the 
movie identifier column 1 104. The movie identifier column II 04 
lists identifiers to various movies . The person identifier column 1 1 
06 lists person identifiers of persons associated with movie roles. The 
role type column 1 1 08 lists the types of roles, such... 
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Detailed Description 

... the function is computed using all preceding rows. The first two rows 
in the table or group however will have the NULL value. 

TRANINT Results Table 

This table supports the TRANINT function that derives counts, amounts, 
percentage means, and intensities from a transaction summary file. This 
function takes as parameters a... 

...calculate the average periodic transaction counts and transaction 

:;^':r.'s by customer and transaction type (such as debit or credit) from a 
• r-j.".50:;t'ion summary table. 

!>ai:a Reorganization - Results Tables and Column Definitions 

i'he following describes the results tables and column definitions for 

the Data Reorganization functions. 

JOIN Results Table 
This table supports the JOIN function that joins tables together 
into a 

combined result table . This function takes as parameters a list of 
tables, keys, and column lists to combine new derived variables into an 
analytic data set, prior to building a matrix or. . . 

...be requested, which returns rows for all key column values found in the 
first table specified, and fills in any missing values from the other 
tables with null values. 



DENORM Results Table 

This table supports the DENORM function that selects or creates a new 
denormalized table. This function takes as parameters a table name, the 
name of key column. . . 
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trailed Description 

tor the Parity column, the 
"FRENCH" value corresponding to the record is the value in 
the corresponding cell of the FRENCH column in the value 

table . In the example, the entry in row 5 of the Parity 
column of the instance table is associated with the record 
being reconstructed. Thus, the "French" value is found in 
row 5 of the "French" column of the value table , whose value 
is "Trois". 

Alternatively, an unsorted column may be included 
In rhe data structures of the present invention by using the 
: i^r:- i cy permutation as the permutation for that column 
. . t», , the value table for that column will not be reordered 

, \, :. n any way ) . 

Column Merge Compression 
In accordance with a further embodiment of the 
invention, separate value table columns can be merged into a 
single column, referred to herein as a "union column, " with 
separate displacement list columns for each of the original 
columns, This has the potential advantages of having a 
smaller value table , , pre- joined data expediting join 
operations and improved update speed. A value not present in 
a particular original column is indicated in the displacement 

table column by a null range for that value, For example 
(assuming a "first row number" format displacement table ), if 
the original column did not have the value at row Irl of the 
merged column, the displacement table for that column would 

have che same value at row Irl and row Ir+11 (that is 
Displacement- Table ( r+1, c) -Displacement Table (r, c) =0) , If Irl 
is the last row in the column, its value is set to a number 
- 32. . . 
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TableDef , 

) f lsNumberOrEnglishLetter = tblFrorn As Table, tblTo As Table, 
! i: LsEnglishLetter (Character) Or FieldNotForCompare ( As String) As 
: flsNumber (Character) Integer 
End Function 

Dim FieldCount As Integer 

Function IslnArrayStr (ByVal aStr As Dim FieldNumber As Integer 
String, aArray( As String) As... 
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Claims 

Claim 

they are executed during an initial set-up phase of query execution. 
They fall into two general groups: DDL set-up queries to create temporary 

tables or indexes; and DML set-up queries, which could be used in 
multi-stage execution strategies to populate temporary tables with 
i itt^rmediace results. Potentially, a redundant sorting of the non-driving 
L^ble in the join by each parallel subquery, either by pre-sorting or 
by preindexing the non-driving table. If pre-sorting is used, only those 
rows which satisfy singletable predicates are inserted in a temporary 
table , which is indexed on the join columns , and 
the temporary table replaces the original table in t 

he FROM clauses of the parallel 

subqueries. If pre-indexing ' is used, the entire table must be indexed 
on the join columns . Either way, the resulting table can now be 
used as the inner table in a nested loops join . Any set-up queries 
which are generated as part of the transformation of a given query must 
be executed to completion before proceeding with execution... 

remaining query types, and could concept ua 1 1 y be_p_e.r f o rme d- -in 

^!.I'-J_.wlx.h it-.-- ~~ " ' 

*:«"rj"-up Queries 

c:;r each set-up query which creates a temporary table or index, a 
corresponding clean-up query is required to dispose of that temporary 
object. Clean-up queries are generated at the same time set... 
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~: ; ! 1 l ex u Ava i 1 a bi 1 i t y : 
; L ; ed Description 

■ i . . • • : >-. i s.-T i ption 

... : piresent invention, the smallest constituent parts are row sources. 
i c*w source is an objectoriented mechanism for manipulating rows of data 
l n a relational database 

system (RDBMS) . A row source is implemented as an iterator. Every row 
source has class methods associated with it {e.g., open, fetch next and 
dose) . 

Examples of row sources include: count, filter, join , sort , union, 
and table scan. 

Other row sources can be used without exceeding the scope of the 

present 

i nvenuion . 

As a result of the compilation process, a plan for the execution of a 
query 

is generated. An execution plan is a plan for the execution... 
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Set Items Description 

51 4083 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (7N) (R0- 

W? ? OR TUPLE? ? OR COLUMN? ? OR FIELD? ? OR CELL? ? OR TABLE? 
?) 

52 365 S1{7N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARAL- 

LEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR 
SAME () TIME OR AS OR WHILE) 

53 1221004 DATABASE? ? OR DATA ( ) BASE? ? OR TABLE? ? 

54 4190 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (S- 

1MULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARALLEL OR SYNC- 
HRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR SAME { ) TIM- 

E) 

324 SI (25N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARA- 
LLEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC - 
OR SAME () TIME OR AS OR WHILE) 

56 95207 REDISTRIBUT? OR RE ( ) DISTRIBUT? 

57 50 S4:S5 AND S6 

58 36 S3 AND S7 

59 21 RD (unique items) 
&lt> : ' -17? S9 NOT PY=2002:2004 

511 83 S2 AND S3 

512 1814 3573 ROW? ? OR TUPLE? ? OR COLUMN? ? OR FIELD? ? OR CELL? ? OR - 

DATA OR INFORMATION 
:V. 3 590701 S12 ( 5N) ( REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? 

OR -SHIFT-??-? -OR-RESH-I-FT??-?-OR-MOV?-??-OR- MIX*???" OR "REORDER? ??"OR 

SORT??? OR RESORT??? OR REARRANG? OR REORGANI? OR REGROUP??? 
OR RE ( ) (ARRANG? OR ORGANI? OR GROUP??? OR ORDER?)) 

514 . 15 Sll AND S13 

515 10 RD (unique items) 

516 29 S5 AND S3 AND S13 

517 19 RD (unique items) 

518 19 S15 OR S17 

519 . 11 S18 NOT (S10 OR PY-=2002 : 2004 ) 
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Abstract: This paper describes PBSM (Partition Based Spatial-Merge) , a 
new algorithm for performing spatial join operation. This algorithm is 
especially effective when neither of the inputs to the join have an index 
on the joining attribute. Such a situation could arise if both inputs to 
the join are intermediate results in a complex query, or in a parallel 
environment where the inputs must be dynamically redistributed . The PBSM 
algorithm partitions the inputs into manageable chunks, and joins them 
•ising a computational geometry based plane-sweeping technique. This paper 
'it so presents a performance study comparing the traditional indexed nested 
loops join algorithm, a spatial join algorithm based on joining spatial 
indices, and the PBSM algorithm. These comparisons are based on complete 
implementations of these algorithms in Paradise, a database system for 
handling GIS applications. Using real data sets, the performance study 
examines the behavior of these spatial join algorithms in a variety of 
situations, including the cases when both, one, or none of the inputs to 
the join have an suitable index. The study also examines the effect of 
clustering the join inputs on the performance of these join algorithms . The 
performance comparisons demonstrates the feasibility, and applicability of 
the PBSM join algorithm. (Author abstract) Refs. 
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Abstract: The transitive closure operation is an important extension to 
relational algebra. Because of its high computation cost, it is of great 
interest to design efficient parallel algorithms for computing the 
transitive closure in relational database systems. In this paper, we 
present a new algorithm to compute transitive closures on SIMD meshes based 
on relational algebra operations. Double-hash distribution is used to avoid 
rehashing new tuples for the next join phase. There presently exists no 
rr: srep for the redistribution of these tuples . Possible redundant 
.. 't-.'z^n between different join phases has been prevented without 
ilobfil operations. As only regular linear communication occurs on 
• :.- :;i?-.sh, and uhe workload is fully distributed, a speedup of 0(n 

m i ; ;p!ied by n) has been achieved, where n multiplied by n is the size of 
ni.ish. Therefore, this algorithm is an optimal parallel version of the 
transitive closure algorithms based on relational algebra operations on 
SIMD meshes. (Author abstract) 28 Refs. 
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Abstract: An important aspect of database processing in parallel 
computer systems is the use of data parallel algorithms. This paper 
presents several parallel algorithms for the relational database join 
operation in a hypercube multicomputer system. The join algorithms are 
classified as cycling or global partitioning based on the tuple 
.Misi ribuuion method employed. The various algorithms are compared under a 
-emmon framework, using time complexity analysis as well as an 
i niplernen tacion on a 64 node NCUBE hypercube system. In general, the global 
par citroning algorithms demonstrate better speedup. However, the cycling 

algorithm ca_n perform bet_tez_than_the . .global -algorithms- i-n specific 

'situations, viz, when the difference in input relation cardinalities is 
large and the hypercube dimension is small. We also study the usefulness of 
the data redistribution operation in improving the performance of the 
join algorithms, in the presence of uneven data partitions. Our results 
indicate that redistribution significantly decreases the join algorithm 
execution times for unbalanced partitions. (Author abstract) 21 Refs. 
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Author: Baru, Chaitanya K.; Frieder, Ophir 

Corporate Source: Univ of Michigan, Ann Arbor, MI, USA 
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Publication Year: 1989 

CODEN: ITCOB4 ISSN: 0018-9340 

Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
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A::^ rare: Distributed memory architectures, specifically hypercubes, for 
.: i c>\ database processing are treated. The cube interconnects 
•:•:.['.:•• :: -efficient data combination for the various database operations, 
unc: nonuniform data distributions are handled by dynamically 
redistributing data utilizing these interconnections. Selection and scalar 
aggregation operations are easily supported. An algorithm for the join 
operation is discussed in some detail. The cube is compared with another 
multicomputer database machine, SM3, and the performance of the join 
operation in these systems is described. The join performance in a cube is 
comparable to that in SM3 even when the cube is assumed to have a 
nonuniform data distribution. 15 refs. 
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Abstract: Strategies for performing database operation in a 
cube-connected multicomputer system with parallel I/O are presented. The 
cube interconnection subsumes many other structures such as the tree, ring, 
etc. This property is utilized to efficiently support database operations 
such as select, aggregate, join , and project. The strategies presented 
are unique in that they account for the nonuniform distribution of data 
across parallel paths by incorporating data redistribution steps as 



part of the overall algorithm. The two main data redistribution 
operations used are tuple balancing and merging. 17 refs. 
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The research question in this thesis concerns how to parallelize the 
spa : :ia 1 range and join query processing in order to support a high 
pt- c norma nee spatial database application. Data partitioning for the range 
query operation involves declustering of spatial data, while data 
partitioning for the spatial join involves clustering of spatial data. If 
the static partitioning methods fail to equally distribute the load among 
different processors, the load-balance may be improved by redistributing 
parts of the data to idle processors using Dynamic Load-Balancing (DLB) 
techniques . 

In this thesis, we provide a framework for declustering collections of 
extended spatial objects by identifying the following key issues: (i) the 
wor'.-Ioad metric, (ii) the spatial-extent of the work-load, (iii) the 
• ! : y : t ibu t ion of the work-load over the spatial-extent, and (iv) the 
:•■ ..<• • ■;■ method. We identify and experimentally evaluate alternatives 
• • : : i hose issues. 

:r .jHriiuion, we also provide a framework for dynamically balancing the 
i; t"iw^en different processors. We experimentally evaluate the proposed 
» v. aliening and load-balancing methods on a distributed memory MIMD 
machine (Cray T3D) and shared-memory machine (SGI Challenge) . Experimental 
results show that the spatial-extent and the work-load metric are important 
issues in developing a declustering method. Experiments also show that the 
replication of data is usually needed to facilitate dynamic load-balancing, 
as the cost of local processing is often less than the cost of data 
transfer for extended spatial objects. In addition, we also show that the 
effectiveness of dynamic load-balancing techniques can be improved by using 
declustering methods to determine the subsets of spatial objects to be 
transferred during run-time. 

A spatial join is often performed in two steps: a filter step and a 
je_£in£Lm£.n-L s.t_ep_. _In~ this— thesis-,— we -focus -on— the -refinement— step _ of "the 
.special join. The refinement step of the spatial join takes as input a 
sequence of pairs of tuples and checks each tuple to see if the join 
predicate is satisfied for that tuple . This is similar to the join 
index processing done in traditional relational databases . We develop 
min-cut graph partitioning based methods for join processing using a join 
index. We use min-cut graph partitioning as a new heuristic for solving the 
page access sequence problem for fixed size buffer in sequential systems. 
We show that the number of page accesses needed to compute a join using 
join index in a fixed buffer environment is bounded by the sum of sizes of 
fhe base relations and the size of the cut-set of the page connectivity 
jraph. Since the min-cut graph partitioning aims to minimize the size of 
-spp, this proposed heuristic is a direct method. Experiments with 



benchmark data sets show that the graph-partitioning based heuristic 
outperforms the existing methods, particularly when join selectivity is 
high and buffer space is small. (Abstract shortened by UMI . ) 
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Abstract: Known parallel DBMS offer at present only static partitioning 
schemes. Adding a storage node is a cumbersome operation that typically 
requires the manual data redistribution . We present an architecture 
termed AMOS-SDDS for a share-nothing multicomputer. We have coupled a 
high-performance main-memory DBMS AMOS-II and a manager of Scalable 
Distributed Data Structures (SDDS) into a scalable distributed system SDDS 
provides the scalable data partitioning in distributed RAM, supporting 
parallel scans with function shipping. AMOS-SDDS couples both systems using 
- !,f» AMOS-II foreign function interface. Its scalability abolishes the 
:j:r.:*«-: some storage limits of a single site RAM DBMS technology. Its 
ribuced RAM query processing and scalable data partitioning is an 
; improvement over the current parallel DBMSs technology. We validate 
AMOS-SDDS architecture by experiments with distributed nested loop join 
queries over a file scaling up to 300,000 tuples. It includes performance 
study of speed-up and scale-up characteristics. The results encourage the 
use of SDDS for high-performance database systems. (19 Refs) 
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[ h: r:ai ionai Conference, DEXA 2000. Proceedings 
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Abstract: Join is an expensive and frequently used operation whose 
parallelization is highly desirable. However, the effectiveness of 
parallel joins depends on the ability to evenly divide load among 

processors. Data skew can have a disastrous effect on performance. Although 
many skew-handling algorithms have been proposed, they remain generally 
inefficient in the case of multi- joins due to join product skew, costly 
and unnecessary redistribution and communication costs. A parallel 

join algorithm called fa- join was introduced in an earlier paper with 
deterministic and near-perfect balancing properties. Despite its 
advantages, fa-join is sensitive to the correlation of the attribute value 
ir;i!i- ions in both relations. We present an improved version of the 
: ; • r.iu called Sfa-join with a symmetric treatment of both relations. Its 
: .» ::*-r.c:bly low join-product and attribute-value skew makes it suitable for 
: -:.^ J :c: use in multi- join operations. Its performance is analyzed 
*. : t e<*:: ei; ically and experimentally, to confirm its linear speed-up and its 
superiority over fa-join. (16 Refs) 
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Abstract: For complex queries in parallel database systems, substantial 
amount's of data must be redistributed between operators executed on 
different processing nodes. Frequently, such intermediate results cannot be 
held in main memory and must be stored on disk. To limit the ensuing 
performance penalty, a data allocation must be found that supports parallel 
I/O to the greatest possible extent. In this paper, we propose declustering 
even self-contained units of temporary data processed in a single operation 
(such as individual buckets of parallel hash joins ) across multiple 
disks. Using a suitable analytical model, we find that the improvement of 



parallel I/O outweighs the penalty of increased fragmentation. (14 Refs) 
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Abstract: We investigate various load balancing approaches for hash-based 
join techniques popular in multicomputer-based shared-nothing database 
systems. When the tuples are not uniformly distributed among the hash, 
buckets, redistribution of these buckets among the processors is 
necessary to maintain good system performance. Two recent load balancing 
techniques which rely on sampling and incremental balancing, respectively, 
have been shown to be more robust than conventional methods. The comparison 
: hose two approaches, however, has not been investigated. In this study, 
mipove these two schemes and implement them along with a conventional 
and a standard join technique which does not do load balancing on 
m: nCUBE/2 parallel computer to compare their performance. Our 
<?:-:per i.mental results indicate that the sampling technique is the better 
approach. To further evaluate the performance of these techniques under 
diverse hardware conditions, we also develop a cost model and implement a 
simulator to perform sensitivity analyses with respect to various hardware 
parameters. The simulation results show that both sampling and incremental 
techniques provide noticeable savings over conventional methods, with the 
sampling approach being more scalable in supporting very large database 
systems. (26 Refs) 
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Abstract: A multiple-weighted-tree based query optimization method for 
parallel database systems is proposed in this paper. The method consists 
of a multiple-weighted-tree based parallel query plan model, a cost model 
for parallel query plans, and a query optimizer. The parallel query plan 
model is the first one to model all basic relational operations, all three 
types of parallelism of query execution in parallel database systems, 
processor and memory allocation to operations, memory allocation to the 
buffers between operations in pipelines and data redistribution among 
processors. The cost model takes the waiting time of the operations in 
: • ; r.H? ! ining execution into consideration and is computable in bottom-up 
:-:sr:icn. The query optimizer addresses the query optimization problem in 
■ :.-• .-oncexc of Select-Project-Join queries that are widely used in 

•:";:;*■:•.: Lai DBMSs. Several heuristics determining the processor allocation 
operations are derived and used in the query optimizer. The query 
optimizer is aware of memory resources in order to generate good-quality 
plans. It includes the heuristics for determining the memory allocation to 
operations and buffers between operations in pipelines so that the memory 
resource is fully exploit. In addition, multiple algorithms for 
implementing join operations are considered in the query optimizer. The 
query optimizer can make an optimal choice of join algorithm for each 

join operation in* a query. The proposed query optimization method has 
been used in a prototype parallel database management system designed 
imp L erne n ted by the author. (11 Refs) 
..t. ■ f * j y : c 
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Abstract: The transitive closure operation is an important extension to 
relational algebra. Because of its high computational cost, it is of great 
interest to design efficient parallel algorithms for computing the 
transitive closure in relational database systems. In this paper, we 
present a new algorithm to compute transitive closures on SIMD meshes based 



on relational algebra operations. The double-hash distribution is used to 
avoid rehashing new tuples for the next join phase. There presently exists 
no extra step for the redistribution of these tuples . Possible 
redundant computation between different join phases has been prevented 
without using global operations. As only regular linear communication 
occurs on the mesh and the workload is fully distributed, a speedup of 
0(n*n) has been achieved, where n*n is the size of mesh. Therefore, this 
algorithm is an optimal parallel version of the transitive closure 
algorithms based on relational algebra operations on SIMD meshes. (28 
Refs) 
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Abstract: When data is uniformly distributed, the parallel hash-based 
join algorithm scales up well. However, the presence of data skew can 
cause load imbalance among the processors, significantly deteriorating its 
performance. In this paper we propose a dynamic skew handling algorithm 
which deals with this load imbalance, by detecting and handling join 
product skews at run-time. The idea is to monitor the join processing at 
the join phase and compare the average processing rate of each partition 
with the rate statically predicted at the scheduling phase. If their 
difference is detected to be large enough to produce a significant 
performance degradation, the processor is considered to be overloaded and a 
workload compensation strategy is dynamically invoked. In this case, based 
on the measured average process_ing_ rate, _the .amount of -overhead -caused by — 
— c^e- -crnpredicte"d ~joln~pfoduct" skew is calculated and, the amount of load to 
be migrated to the non-overloaded processors is determined. We propose two 
methods-the result redistribution and the processing task migration-to 
handle the load migration from the overloaded processor to the 
non-overloaded processors. Simulation results show that our dynamic skew 
handling approach can detect and handle load imbalances efficiently, so 
that the rebalance of load among the processors results in an almost 
constant join execution time under different join product skews. (11 Refs) 
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Abstract: Parallel database systems have to support both 
inter- t ransact ion as well as intra-transaction parallelism. 

* r.cer- transact ion parallelism (multi-user mode) is required to achieve high 
■ h : -oucihput , in particular for OLTP transactions, and sufficient 

•rsi-erf ectiveness . Intra-transaction parallelism is a prerequisite for 
inducing the response time of complex and data-intensive transactions 
(queries) . In order to achieve both goals dynamic strategies for load 
balancing and scheduling are necessary which take the current system state 
into account for allocating transactions and subqueries to processors and 
for determining the degree of intra-transaction parallelism. We study the 
load balancing problem for parallel join processing in shared nothing 

database systems. In these systems, join processing is typically based 
on a dynamic redistribution of relations to join processors thus making 
dynamic load balancing strategies feasible. In particular, we study the 
performance of dynamic load balancing strategies for determining the number 
of join processors and for selection of the join processors. In 
concrast to previous studies on parallel join processing, we present a 
nulci-user performance analysis for both homogeneous and 
heterogeneous/mixed workloads as well as for different database 

>j I locations . (34 Refs) 
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We investigate various load balancing approaches for hash-based join 
techniques popular in multicomputer-based shared-nothing database 
systems. When the tuples are not uniformly distributed among the hash 
buckets, redistribution of these buckets among the processors is 
necessary to maintain good system performance. Two recent load balancing 
techniques which rely on sampling and incremental balancing, respectively, 
have been shown to be more robust than conventional methods. The comparison 
of these two approaches, however, has not been investigated. In this study, 
we improve these two schemes and implement them along with a conventional 
ir.o'hod and a standard join technique which does not do load balancing on 
.i.-i r,CUB£/2 parallel computer to compare their performance. Our 
■•xper imencal results indicate that the sampling technique is the better 
approach. To further evaluate the performance of these techniques under 
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sampling approach being more scalable in supporting very large database 
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Recently, Haas and Hellerstein proposed the hash ripple join algorithm in the context of 
online aggregation. Although the algorithm rapidly gives a good estimate for many join- 
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satisfy the join predicate is small or if there are many groups in the output. Furthermore, if 
memory overflows (for example, because the user allows the algorithm to run to completion 
for an exact answer), the algorithm degenerates to bl ... 



2 Query evaluation techniques for large databases H 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 

Full text available: "H pdfiS.37 m?r- Additional Information: Mcftatjon, abstract, references, citings, index 
' ™* ' ' ' " terms, review 

Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to 
provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 
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represent the class of hash-based join methods, Simple hash represents a looping algorithm 
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The similarity join is an important operation for mining high-dimensional feature spaces. 
Given two data sets, the similarity join computes all tuples (x, y) that are within a distance 
&egr;.One of the most efficient algorithms for processing similarity-joins is the 
Multidimensional-Spatial Join (MSJ) by Koudas and Sevcik. In our previous work — 
pursued for the two-dimensional case — we found however that MSJ has several 
performance shortcomings in terms of CPU and I/O cost as ... 
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This paper presents the results of an initial performance evaluation of the Gamma database 
machine. In our experiments we measured the effect of relation size and indices on 
response time for selection, join, and aggregation queries, and single-tuple updates. A 
Teradata DBC/1012 database machine of similar size is used as a basis for interpreting the 
results obtained. We also analyze the performance of Gemma relative to the number of 
processors employed and study the impact of varying the m ... 
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A set containment join is a join between set-valued attributes of two relations, whose join 
condition is specified using the subset (c) operator. Set containment joins are deployed in 

many database applications, even those that do not support set-valued attributes. In this 
article, we propose two novel partitioning algorithms, called the Adaptive Pick-and-Sweep 
Join (APS3) and the Adaptive Divide-and-Conquer Join (ADCJ), which allow computing set 
containment joins efficiently. We show that ... 
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Inter-object references are one of the key concepts of object-relational and object-oriented 
database systems. In this work, we investigate alternative techniques to implement inter- 
object references and make the best use of them in query processing, i.e., in evaluating 
functional joins. We will give a comprehensive overview and performance evaluation of all 
known techniques for simple (single-valued) as well as multi-valued functional joins. 
Furthermore, we will describe special order-preser ... 

Keywords: Functional join, Logical OID, Object identifier, Order-preserving join, Physical 
OID, Pointer join, Query processing 



9 ClassincM^ 

Kai-Uwe Sattler, oiiver Dunemann 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Full text available- M pdff 1.5G MB: Additional Information: MUMum, abstract, .references, citings, Index 
' ^ "'" " ; terms 

Scalable data mining in large databases is one of today's challenges to database 
technologies. Thus, substantial effort is dedicated to a tight coupling of database and data 
mining systems leading to database primitives supporting data mining tasks. In order to 
support a wide range of tasks and to be of general usage these primitives should be rather 
building blocks than implementations of specific algorithms. In this paper, we describe 
primitives for building and applying decision tree classifi ... 
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The join operation is one of the fundamental relational database query operations. It 
facilitates the retrieval of information from two different relations based on a Cartesian 
product of the two relations. The join is one of the most diffidult operations to implement 
efficiently, as no predefined links between relations are required to exist (as they are with 
network and hierarchical systems). The join is the only relational algebra operation that 
allows the combining of related tuples fro ... 
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During the last decade, multimedia databases have become increasingly important in many 
application areas such as medicine, CAD, geography, and molecular biology. An important 
research issue in the field of multimedia databases is the content-based retrieval of similar 
multimedia objects such as images, text, and videos. However, in contrast to searching 
data in a relational database, a content-based retrieval requires the search of similar 
objects as a basic functionality of the database system ... 
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Scheduling query execution plans is an important component of query optimization in 
parallel database systems. The problem is particularly complex in a shared-nothing 
execution environment, where each system node represents a collection of time-shareable 
resources (e.g., CPU(s), disk(s), etc.) and communicates with other nodes only by 
message-passing. Significant research effort has concentrated on only a subset of the 
various forms of intra-query parallelism so that scheduling and synchronizati ... 
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Search operations in databases require special support at the physical level. This is true for 
conventional databases as well as spatial databases, where typical search operations 
include the point query (find all objects that contain a given search point) and the region 
query (find all objects that overlap a given search region). More than ten years of spatial 
database research have resulted in a great variety of multidimensional access methods to 
support ... 
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Since the early days of logic programming, researchers in the field realized the potential for 
exploitation of parallelism present in the execution of logic programs. Their high-level 
nature, the presence of nondeterminism, and their referential transparency, among other 
characteristics, make logic programs interesting candidates for obtaining speedups through 
parallel execution. At the same time, the fact that the typical applications of logic 
programming frequently involve irregular computatio ... 
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Object-oriented and object-relational DBMS support set valued attributes, which are a 
natural and concise way to model complex information. However, there has been limited 
research to-date on the evaluation of query operators that apply on sets. In this paper we 
study the join of two relations on their set-valued attributes. Various join types are 
considered, namely the set containment, set equality, and set overlap joins. We show that 
the inverted file, a powerful index for selection queries, c ... 
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Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit 
locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 
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An overview is presented of object-based and image-based representations of objects by 
their interiors. The representations are distinguished by the manner in which they can be 
used to answer two fundamental queries in database applications: (1) Feature query: given 
an object, determine its constituent cells (i.e., their locations in space). (2) Location query: 
given a cell (i.e., a location in space), determine the identity of the object (or objects) of 
which it is a member as well as the re ... 

Keywords: Access methods, R-trees, feature query, geographic information systems 
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Research aimed at correcting words in text has focused on three progressively more difficult 
problems: (1) nonword error detection; (2) isolated-word error correction; and (3) context- 
dependent work correction. In response to the first problem, efficient pattern-matching and 
n-gram analysis techniques have been developed for detecting strings that do not appear in 
a given word list. In response to the second problem, a variety of general and application- 
specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 
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Spatial join finds pairs of spatial objects having a specific spatial relationship in spatial 
database systems. A number of spatial join algorithms have recently been proposed in the 
literature. Most of them, however, perform the join in the original space. Joining in the 
original space has a drawback of dealing with sizes of objects and thus has difficulty in 
developing a formal algorithm that does not rely on heuristics. In this paper, we propose a 
spatial join algorithm based on the tran ... 
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